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COMPOSITIONS AND METHODS FOR BIODETECTION 
BY NUCLEIC ACID-TEMPLATED CHEMISTRY 



RELATED APPLICATIONS 

[0001] This application claims the benefit of and priority to U.S. Patent Applications Serial 
Nos. 60/847,859, filed September 28, 2006, 60/905,364, filed March 7, 2007; and 60/918,023, 

filed March 14, 2007, and PCT International Application No. PCT/US07/ , titled 

"Receptor Family Profiling," by Landsman et aL, filed September 18, 2007 in the U.S. 
5 Receiving Office, the entire disclosure of each of which is incorporated by reference herein for 
all purposes. 

FIELD OF THE INVENTION 

[0002] The present invention relates generally to probes and their use in biodetection and 
diagnostics. More particularly, the invention relates to compositions and methods for 
10 biodetection using nucleic acid-templated chemistry (e.g., synthesis of compounds having 

desired fluorescent, chemiluminescent or chromophoric properties in a multiplex detection of 
nucleic acids or proteins). 

BACKGROUND 

[0003] Fluorescent and colored compounds have been used in the fields of biological 
1 5 research and medicine to detect the presence, absence, state, quantity, and composition of 

biomolecules. Assays using fluorescent and colored compounds may be performed in vitro, in 
situ, or in vivo. Examples of commonly used in vitro assays for detection of DNA and RNA 
are real-time and end-point polymerase chain reaction (PCR), DNA sequencing, and DNA 
microarray technologies. 

20 [0004] Recently, there has been an increased amount of literature published on detection 
methods for multiple analytes, most of which involves genetic analysis and some relates to 
protein detection. See, e.g., U.S. Patent No. 6,890,741. 

[0005] In a typical nucleic acid detection method used for diagnostic and molecular biology 
research, multiple gene probes complementary with a gene of interest are labeled with small 
25 molecules that can be detected by spectroscopic, electrochemical, biochemical or 



WO 2008/054600 



PCT/US2007/021094 



-2- 

immunochemical means. PCR is generally incorporated for the amplification of targeted gene 
sequences. To achieve detection of multiple analytes, fluorescence-based technologies have 
been used often due to fluorescence dyes readily available. For example, primers have been 
labeled with different fluorescence dyes and the changes in fluorescence were monitored upon 
5 hybridization to their complements (e.g., WO 2002/057479). In other cases, the multiplex 
detection was achieved by using intercalating dyes as labels in DNA restriction fragment 
analysis and capillary electrophoresis with frequency-domain fluorescence lifetime detection 
method (Mcintosh, et al., Electrophoresis, 2002, 23, 1473-1479). Since these methods use pre- 
labeled fluorescence dyes, the detection sensitivity relies largely on the separation of target 

10 bound and unbound fluorescence labeled probes. Though solid phase immobilization of the 
target gene (fluorescence in situ hybridization, for example) can improve the separation 
efficiency by simply washing away the unbound fluorescence labeled probes, this introduces an 
extra process. However, the potential background still can be high, and the procedure can be 
laborious. To address this problem, a non-fluorescence label moiety can be attached to the 

1 5 probes so that the fluorescence signal only occurs after the hybridization event. Recently, the 
development of DN A-programmed chemistry has provided a novel approach for generation of 
fluorescence dye in situ. See , e.g., Li, X.; Liu, D. R. Angew. Chem. Int. Ed. 2004, 43, 4848- 
4870; U.S. Patent No. 7,070,928. 

[0006] Polymethine dye has been widely used as laser dyes, photographic sensitizers and 
20 fluorescence probes due to its superior fluorescence and photochemical properties. However, 
polymethine dyes are generally synthesized by acid/base catalyzed condensation under 
anhydrous conditions which is not comparable to the nucleic acid-templated chemistry 
(Jedrzejewska, et al. Dyes and Pigments 2003, 58, 47-58). Recently, the literature has reported 
an improved aldol condensation in water using Lewis-acid (Kobayashi, et al., J. Am. Chem. 
25 Soc. 1998, 120, 8287-8288) and enamine-based organocatalyst (Mase, et al. J. Am. Chem. Soc. 
2006, 128, 734-735). The quaternary salt of polymethine precursor (active hydrogen 
component) used for condensing with aldehyde, however, is different substantially from the 
precursor (alpha carbon of aldehyde) in a conventional aldol condensation. 

[0007] Thus, there exists a need for new fluorescent and colorimetric technologies that 
30 address many of the shortcomings inherent in the above-mentioned biodetection methods. For 
example, there is a need for methods of polymethine dye synthesis from non-detectable 
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precursors by nucleic acid-templated chemistry and adaptation of such chemistry to 
biodetection. 



SUMMARY OF THE INVENTION 



[0008] The present invention is based, in part, upon the discovery that nucleic acid- 
5 templated chemistry can be applied in detection of multiple biological targets simultaneously. 
The present invention is based, in part, upon the discovery that polymethine dyes can be 
synthesized by nucleic acid-templated chemistry. Assays of this invention may be performed 
in vitro, in situ, or in vivo. 

[0009] In one aspect, the present invention relates to a method for making a polymethine dye 
10 comprising conducting an aldol condensation between an aldehyde and an active hydrogen 
component in an aqueous condition in the presence of an organocatalyst. 

[0010] In some embodiments, the condensation reaction is: 



wherein 

Z' = O, S, Se, P, NH 2 , NR, C(CH 3 ) 2 where R is alkyl group 
n = 0,1,2... 
R = H, alkyl 

R" = H, alkyl, alkyl carboxylic acid 
15 R' = Ph or A/-heterocycle, H, alkyl, S0 3 H, OH, CN, CI, Br, N0 2 , NH 2 , N(R) 2 , OR where R is alkyl grouf 

and wherein the organocatalyst is a secondary amine, a primary amine, a bifunctional 
amine-acid catalyst or a diamine. The secondary amine may be a pyrrolidine, a piperidine, a 
nornicotine, or an analog thereof, for example. The primary amine may be a valine or a peptide 
20 having fewer than 3 amino acid units, for example. The bifunctional amine-acid catalyst may 
be pyrrolidine/ AcOH, for example. The diamine catalyst may be Nl,Nl-dimethylethane-l,2- 
diamine, propane- 1,2-diamine, l-(2-aminoethyl)-piperidine, or an analog thereof, for example. 





R 



Aldehyde 



Active hydrogen component 



Hemicyanine 
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[0011] In another aspect, the invention generally relates to a hemicyanine dye having the 
chemical structure of (I), (II) or (III), for example, prepared by the methods disclosed herein. 



FT' 



(I) 

Z' = O, S, Se, P, NH 2 , NR, C(CH 3 ) 2 where R is alky! group 
n = 0, 1, 2 ... 
R = H, alkyl 

R" = H, alkyl, alkyl carboxylic acid 

R' = Ph or A/-heterocycle, H, alkyl, SQ 3 H, OH, CN, CI, Br, N0 2 , NH 2 , N(R) 2 , OR where R is alkyl group 





' n 2 OH 




n1 = 1,2, 3,4,5 
n2, n3 = 0 to 16 
R1 = alkyl 

R2 = Ph, H, alkyl, S0 3 H, OH, CN, CI, Br, N0 2 , 
NH 2 , N(R 1 ) 2 , OR, while R, is alkyl group 
R 3 = Ph, H, alkyl, S0 3 H, OH, CN, CI, Br, N0 2 , 
NH 2l N(R 1 ) 2l ORi while R, is alkyl group 



n1 = 1,2, 3,4,5 
n2, n3 = 0to 16 
R1 = alkyl 

R2 = Ph, H, alkyl, S0 3 H, OH, CN, CI, Br, N0 2 , 
NH 2l N(Ri) 2 , ORt while R A is alkyl group 
R 3 = Ph, H, alkyl, S0 3 H, OH, CN, CI, Br, N0 2 , 
NH 2 , N(R 1 ) 2 , OR 1 while R, is alkyl group 



10 



(II) (III) 

[0012] In yet another aspect, the invention generally relates to an aldehyde having the 
chemical structure of IV or V: 




n1 » 1,2, 3,4, 5 
n2 = 0 to 16 
R1 = alkyl 

R 2 = Ph, H, alkyl, SO3H, OH, CN, CI, Br, 
N0 2 , NH 2 , N(R 1 ) 2 , OR, while R, is alkyl 
group 

(IV) 




n1 = 1, 2, 3, 4, 5 
n2 = 0to 16 
R1 = alkyl 

R 2 = Ph, H, alkyl, SO3H, OH, CN, CI, Br, 
N0 2 , NH 2 , N(R 1 ) 2 , ORi while R A is alkyl 
group 

(V) 
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[0013] In yet another aspect, the invention generally relates to a quaternary salt having the 
chemical structure of VI or VII: 




n = 0tol6 

R = H, alkyl, S0 3 H, OH, 
CN, CI, Br, N0 2 , NH 2 , 
N(Ri)2. ORt while R-j is 
alkyl group 





n = 0to16 

R = H, alkyl, S0 3 H, OH, 
CN, CI, Br, N0 2 , NH 2 , 
N(Ri)2. ORi while Ri is 
alkyl group 



(VI) (VII) 
[0014] In yet another aspect, the invention generally relates to an quaternary salt-nucleic 
acid conjugate having the chemical structure of: 



R— 



nucleic acid 



n = 0 to 16 

Z1 = O, S, Se, P, NH 2 , NRi, C(CH 3 ) 2 where R A is alkyl group 
R = any substituted benzyl or higher fused benzyl rings, H, 
alkyl, S0 3 H, OH, CN, CI, Br, N0 2 , NH 2> N(R 1 ) 2 . OR, while R, 
is alkyl group 

Z 2 = benzene or any N-heterocycles 
[0015] In yet another aspect, the invention generally relates to an aldehyde -nucleic acid 
conjugate having the chemical structure of: 



R 2 I : 

I ^ n 2 N^ n nucleic acid 



n1 = 1, 2, 3, 4, 5 
n2 = 0to16 
R1 = H, alkyl 

R 2 = Ph or N-heterocycle, H, alkyl, S0 3 H, 
OH, CN, CI, Br, N0 2 . NH 2 , N(R 1 ) 2 , ORi 
while Ri is alkyl group 
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[0016] In yet another aspect, the invention generally relates to a hemicyanine dye-nucleic 
acid conjugate having the chemical structure of: 



[0017] In yet another aspect, the invention generally relates to making a hemicyanine- 
5 nucleic acid conjugate comprising conducting a nucleic acid-templated reaction between an 
aldehyde and quaternary salt disclosed herein to make a hemicyanine disclosed herein. 

[0018] In some embodiments, the nucleic acid-templated reaction is in an end of helix 
architect. In some other embodiments, the nucleic acid-templated reaction is in a middle of 
helix architect. 

1 0 [0019] In yet another aspect, the invention generally relates to a method for selecting a dye 
having a desired fluorescent property. The method includes (a) preparing a library of 
oligonucleotide-encoded dyes through nucleic acid-templated synthesis; (b) hybridizing the 
oligonucleotide-encoded dyes with spatially arrayed complementary oligonucleotide probes 
immobilized on a solid support; (c) measuring the absorption and fluorescence properties of the 

1 5 oligonucleotide-encoded dye directly on the solid support; (d) identifying the oligonucleotides 
that encode the dyes having the desired fluorescence properties based on the position of the 
immobilized complementary oligonucleotide probes, and (e) identifying and characterizing the 
chemical structure of the dyes having the desired fluorescence property. 

[0020] In yet another aspect, the invention generally relates to a method for detecting 
20 multiple target nucleotide sequences. The method includes: (a) providing a number of probe 
pairs, the number equal to the number of target nucleotide sequences, wherein each probe pair 
comprises (1) a first probe comprising (i) a first oligonucleotide sequence and (ii) a first 
reactive group linked to the first oligonucleotide sequence, and (2) a corresponding second 
probe comprising (i) a second oligonucleotide sequence and (ii) a second reactive group linked 
25 to the second oligonucleotide sequence, wherein the first oligonucleotide sequence and the 



nucleic acid N 




nucleic acid 



n1 =1,2, 3,4,5 

n2 and n3 = 0 to 16 

Z1 = O, S, Se, P, NH 2 , NR 1f C(CH 3 ) 2 

where is alkyl group 

R 3 = any substituted benzyl or higher 

fused benzyl rings, H, alkyl, S0 3 H, OH, 

CN, CI, Br, N0 2 , NH 2 , N(R 1 ) 2 , OR] while 

R 1 is alkyl group 

Z 2 = benzene or any N-heterocycles 
R1 = H, alkyl 

R 2 = Ph or N-heterocycle, H, alkyl, S0 3 H, 
OH, CN, CI, Br, N0 2 , NH 2l N(R 1 ) 2 , OR 1 
while Ri is alkyl group 



WO 2008/054600 



PCT/US2007/021094 



-7- 

second oligonucleotide sequence are complementary to two separate regions of a corresponding 
target nucleotide sequence; (b) combining the probe pairs with a sample to be tested for the 
presence of the target nucleotide sequences under conditions where the first probes and the 
second probes hybridize to their respective complementary regions of the target nucleotide 
5 sequences if present in the sample thereby bringing into reactive proximity the first reactive 
groups and the corresponding second reactive groups; and (c) detecting one or more reactions 
between the first reactive groups and the corresponding second reactive groups thereby 
determining the presence of the target nucleotide sequences. 

[0021] The number of target nucleotide sequences may be between about 2 to about 20, for 
10 example, 2 to 6. The target nucleotide sequences may be in solution phase. The target 

nucleotide sequences may be attached to a solid support. In some embodiments, the one or 
more reactions between the first reactive groups and the corresponding second reactive groups 
generate fluorescent compounds that may be detected. In some embodiment, the one or more 
reactions between the first reactive groups and the corresponding second reactive groups 
15 generate chemiluminescent compounds that may be detected. 

[0022] The one or more reactions between the first reactive groups and the corresponding 
second reactive groups may comprise an aldol condensation reaction, for example. The one or 
more reactions between the first reactive groups and the corresponding second reactive groups 
may comprise a Wittig reaction. 

20 [0023] The invention encompasses a kit that provides one, two or more of the probes 

described herein. More particularly, the invention encompasses a kit that provides one, two or 
more of the probes that utilize nucleic acid-templated chemistry for the generation of detectable 
signals as a way for detecting the presence of biological targets. 

[0024] The foregoing aspects and embodiments of the invention may be more fully 
25 understood by reference to the following figures, detailed description and claims. 

DEFINITIONS 

[0025] The term, "DNA programmed chemistry" or "DPC", as used herein, refers to nucleic 
acid-templated chemistry, for example, sequence specific control of chemical reactants to yield 
specific products accomplished by (1) providing one or more templates, which have associated 
30 reactive group(s); (2) contacting one or more transfer groups (reagents) having an anti-codon 
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(e.g. 5 complementary sequence with one or more templates) and reactive group(s) under 
conditions to allow for hybridization to the templates and (3) reaction of the reactive groups to 
yield products. For example, in a one-step nucleic acid-templated reaction, hybridization of a 
"template" and a "complementary" oligonucleotide bring together reactive groups followed by 
5 a chemical reaction that results in the desired product. Structures of the reactants and products 
need not be related to those of the nucleic acids comprising the template and transfer group 
oligonucleotides. See, e.g., U.S. Patent No. 7,070,928 and U.S. Application Publication No. 
2004/0180412 Al, by Liu et al.; Gartner, et al, 2004, Science, vol. 305, pp. 1601-1605; 
Doyon, et al., 2003, JACS, vol. 125, pp. 12372-12373, all of which are expressly incorporated 
10 herein by reference in their entireties. See, also, "Turn Over Probes and Use Thereof by Coull 
et al, PCT International Patent Application PCT/US06/16999, filed on May 3, 2006; U.S. 
patent Application Serial No. 1 1/441,804, "Biodetection by Nucleic Acid-Templated 
Chemistry" by Coull et al., filed on May 26, 2006. 

* 

[0026] The terms, "nucleic acid", "oligonucleotide" (sometimes simply referred to as 
15 "oligo") or "polynucleotide" or as used herein refer to a polymer of nucleotides. The polymer 
may include, without limitation, natural nucleosides (i.e., adenosine, thymidine, guanosine, 
cytidine, uridine, deoxyadenosine, deoxythymidine, deoxyguanosine, and deoxycytidine), 
nucleoside analogs (e.g., 2-aminoadenosine, 2-thiothymidine, inosine, pyrrolo-pyrimidine, 3- 
methyl adenosine, 5-methylcytidine, C5-bromouridine, C5-fluorouridine, C5-iodouridine, C5- 
20 propynyl-uridine, C5-propynyl-cytidine, C5-methylcytidine, 7-deazaadenosine, 7- 

deazaguanosine, 8-oxoadenosine, 8-oxoguanosine, 0(6)-methylguanine, and 2-thiocytidine), 
chemically modified bases, biologically modified bases (e.g., methylated bases), intercalated 
bases, modified sugars (e.g., 2 -fluororibose, ribose, 2'-deoxyribose, arabinose, and hexose), or 
modified phosphate groups (e.g., phosphorothioates and 5 f -N-phosphoramidite linkages). 
25 Nucleic acids and oligonucleotides may also include other polymers of bases having a modified 
backbone, such as a locked nucleic acid (LNA), a peptide nucleic acid (PNA), a threose nucleic 
acid (TNA). 

[0027] Throughout the description, where compositions are described as having, including, 
or comprising specific components, or where processes are described as having, including, or 
30 comprising specific process steps, it is contemplated that compositions of the present invention 
also consist essentially of, or consist of, the recited components, and that the processes of the 
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present invention also consist essentially of, or consist of, the recited processing steps. Further, 
it should be understood that the order of steps or order for performing certain actions are 
immaterial so long as the invention remains operable. Moreover, two or more steps or actions 
may be conducted simultaneously. 

5 BRIEF DESCRIPTION OF THE FIGURES 

The invention may be further understood from the following figures in which: 

[0028] FIG. 1 illustrates the general chemical structures of polymethine, cyanine and 
hemicyanine dyes. 

[0029] FIG. 2 illustrates the general chemical structures of hemicyanine dyes useful for 
10 multiplex and their aldehyde and quaternary salt precursors. 

[0030] FIG. 3 illustrates the chemical structures of a four-plex hemicyanine-DNA dye 
system and their spectroscopic properties. 

[0031] FIG. 4 is a schematic representation of solution phase-based DPC fluorescence assay 
for multiple analytes. 

15 [0032] FIG. 5 is a schematic representation of solid phase-based DPC fluorescence assay for 
multiple analytes. 

[0033] FIG. 6 is a schematic representation of immunohistochemistry test for multiple 
family receptor dimers in non-zip-coded architecture. 

[0034] FIG. 7 is a schematic representation of immunohistochemistry test for multiple 
20 family receptor dimers in zip-coded architecture. 

[0035] FIG. 8 is a schematic representation of a method for the detection of a biological 
target under one embodiment of the present invention. 

[0036] FIG. 9 is a schematic representation of a method for the detection of a biological 
target under one embodiment of the present invention. 

25 [0037] FIG. 10 shows examples of hybridization as affected by concentration, temperature, 
and the presence or absence of a single base pair mismatch. 
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[0038] FIG, 11 shows exemplary oligonucleotides used in certain melting curve 
experiments. 

[0039] FIG, 12 is a schematic representation of a method for the detection of a biological 
target under one embodiment of the present invention. 

5 [0040] FIG. 13 is a schematic representation of a method for the detection of platelet 
derived growth factor (PDGF) under one embodiment of the present invention. 

[0041] FIG. 14 shows exemplary embodiment of a splinted, zip-coded detection system with 
aptamers as target binding moieties. 

[0042] FIG. 15 shows exemplary embodiment of a splinted, zip-coded detection system with 
10 antibodies as target binding moieties. 

[0043] FIG. 16 shows exemplary embodiment of a splinted, zip-coded detection system with 
antibodies as target binding moieties. 

[0044] FIG. 17 shows absorption and fluorescence emission spectra of DPC reaction 
mixtures (end of helix). 

1 5 [0045] FIG. 18 shows absorption and fluorescence emission spectra of DPC reaction 
mixtures (end of helix). 

[0046] FIG. 19 shows LC-MS data of a crude DPC reaction mixture. 

[0047] FIG. 20 shows LC-MS data of a DPC reaction product. 

[0048] FIG. 21 shows absorption and fluorescence emission spectra of DPC reaction 
20 mixtures. 

[0049] FIG. 22 shows LC-MS data of a DPC reaction product. 

[0050] FIG. 23 shows certain fluorescence intensity data of a DPC reaction. 

[0051] FIG. 24 shows certain fluorescence intensity data of a DPC reaction. 

[0052] FIG. 25 shows fluorescence excitation and emission spectra of four hemicyanine 
25 dyes. 
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[0053] FIG. 26 shows exemplary DNA sequences useful for four-plex hemicyanine dye 
generation. 

[0054] FIG, 27 shows normalized fluorescence emission spectra of four DNA conjugated 
hemicyanine dyes from middle of helix DPC reactions. 

5 [0055] FIG. 28 shows fluorescence kinetic analysis of two DPC reactions in end of helix 
architecture. 

[0056] FIG. 29 shows an example of fluorescence signal generation and biological target 
detection via triphenylphosphine (TPP) and azidocoumarin (AzC) reporter chemistry. 

[0057] FIG. 30 shows an example of fluorescence signal generation and biological target 
1 0 detection via TPP and AzC reporter chemistry. 

[0058] FIG. 31 shows certain examples of melt curves illustrating the effect of 
oligonucleotide concentration on T m . 

[0059] FIG. 32 shows certain examples with DNA hybridization melting curves of 
biotinylated oligonucleotides with and without avidin. 

15 [0060] FIG. 33 shows certain examples of T m changes of complementary biotinylated oligos 
upon binding to avidin. 

[0061] FIG. 34 shows certain examples of the effect of salt and magnesium concentrations 
upon T m of oligonucleotides +/- biotin. 

[0062] FIG. 35 shows certain examples of the melting temperature behavior of biotinylated 
20 oligonucleotides at different ratios of oligonucleotides to avidin. 

[0063] FIG. 36 shows certain examples of melting curves of 5' and 3' (-) biotin-strand 
oligos duplexed with biotin-5' (+) strand oligo in the absence and presence of avidin. 

[0064] FIG. 37 shows certain examples of melting curves of AT-rich biotinylated oligo 
dimers with and without avidin. 

25 [0065] FIG. 38 is a schematic representation of a method for the detection of a biological 
target under one embodiment of the present invention. 
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[0066] FIG. 39 shows examples of experimental results on detection of a biological target 
under one embodiment of the present invention. 

[0067] FIG. 40A and FIG. 40B show examples of experimental results (the effect of 
formamide in the reaction mixture) on detection of a biological target under one embodiment of 
5 the present invention. 

[0068] FIG. 41A and FIG. 41B show examples of experimental results (the effect of 
formamide in the reaction mixture) on detection of a biological target under one embodiment of 
the present invention. 

[0069] FIG. 42 shows examples of experimental results (the effect of formamide in the 
10 reaction mixture) on detection of a biological target under one embodiment of the present 
invention. 

[0070] FIG. 43 shows examples of experimental results (time course of reaction mixtures) 
on detection of a biological target under one embodiment of the present invention. 

[0071] FIG. 44 shows examples of experimental results (time course of reaction mixtures) 
1 5 on detection of a biological target under one embodiment of the present invention. 

[0072] FIG. 45 shows examples of experimental results (probe ratios) on detection of a 
biological target under one embodiment of the present invention. 

[0073] FIG. 46 shows an example of detection of PDGF by a zip-coded detection system. 

[0074] FIG. 47 shows experiments on ratios of aptamers and reporters. 

20 [0075] FIG. 48 illustrates an embodiment of a "one-piece" detection system for the 
detection of PDGF. 

DETAILED DESCRIPTION OF THE INVENTION 

[0076] In its simplest sense, the invention is to detect the presence of target analytes via 
nucleic acid-templated chemistry, for example, through measurement of fluorescence of 
25 polymethine dyes generated by nucleic acid-templated reactions templated by target nucleic 

acids or proteins. The present invention provides methods for analysis of multiple analytes in a 
convenient, accurate and sensitive way. For example, in the detection of nucleic acids, the 
method uses nucleic acid probes conjugated with non-fluorescence precursor (e.g., aldehydes 
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and methyl quaternary salts) and polymethine multiplex dyes are generated through the 
chemical reaction of the probes upon hybridization with target nucleic acids. In addition, the 
invention provides novel chemical compositions of polymethine dyes and methods of 
synthesizing polymethine dyes in conventional reactions under aqueous conditions as well as 
5 via nucleic acid-templated chemistry. 

Polymethine Dye Chemistry 

[0077] Polymethine dye is characterized by a chain of methine (-CH=) groups with an 
electron donor (D) and an electron acceptor (A) at opposite ends of their polyene chain (FIG. 1, 
Zollinger, Color Chemistry: Syntheses, Properties, and Applications of Organic Dyes and 

10 Pigments, 3nd Edn., Verlag Helvetica Chimica Acta, Postfach, Switzerland, 2003). Typical A 
and D terminals for polymethine dyes (as shown in FIG. 1) include thiazoles, pyrroles, 
pyrrolines, indoles, 1, 3, 3-trimethylindolines, tetrazoles, pyrimidine, pyridines, quinolines, and 
higher fused N-heterocycles or any substituted benzyl rings. If the terminals are both jV-atom 
containing heterocycles, the compound is named cyanine. If only one //-atom is part of the ring 

15 system, the compound is named hemicyanine. By changing the number of the vinyl group in 
the polyene chain, the fluorescence emission wavelength of the polymethine dye can be tuned 
from near-UV to near-IR. The terminal group may also provi3e mean for finer tuning. 

[0078] Scheme 1 depicts hemicyanine formation through organocatalytic aldol condensation 
in aqueous buffer. Some of the general organocatalysts such as pyrrolidine analogues have been 
20 listed here. By using catalyst, the reaction condition for the hemicyanine formation can switch 
from anhydrous to aqueous condition. The percentage of the water content used depends only 
on the solubility of the starting materials. 

[0079] Scheme 2 is a schematic illustration of hemicyanine dye generation through DPC in 
the presence of organocatalyst. The general chemical structures of component A 
25 (aldehyde_DNA), component B (quaternary salt bearing active hydrogen component) and 

hemicyaine_DNA conjugate have also been described. The fluorescence emission wavelength 
of the hemicyaine dye can be tuned by changing the number of the vinyl group in the polyene 
chain (n) or by using different substitute groups (R') and terminal groups in component B. 

[0080] FIG. 2 shows the general chemical structures of hemicyanine dyes useful for 
30 multiplex and their aldehyde and quaternary salt precursors. An example of four-plex 
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hemicyanine_DNA dyes derived from the general structure and their maximum UV absorption 
and fluorescence's emission wavelength has been described in FIG, 3. 



Scheme 1: Hemicyanine formation through organocatalytic 
aldol condensation in aqueous buffer. 



aqueous buffer, RT 
organocatalyst 



A: aldehyde 



B: Active hydrogen component 




Hemicyanine 



Z' = O, S, Se, P. NH 2 , NR, C(CH 3 ) 2 where R is alkyt group 
n = 0, 1, 2 ... 
R = H, alkyl 

R" = H, alkyl, alkyl carboxylic acid 

R' = Ph or /V-heterocycle, H, alkyl, S0 3 H, OH, CN, CI, Br, N0 2 , NH 2 , N(R) 2 , OR where R is alkyl group 
Organocatalysts: 

1) Pyrrolidine, piperidine analogues 

2) Amino acid such as valine or small peptide 

3) Bifunctional amine-acid catalysts such as pyrrolidine/AcOH 

4) Diamine such as dimethythe-1 ,2-diamine, propane-1 ,2-diamine 
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Scheme 2: Hemicyanine dye generation through 

organocatalyst catalyzed DPC. 
. DPC coupling 



A' alrlohuHo \ +Jt 




B: Active hydrogen component 



Z' = O. S, Se, P. NH 2 , NR, C(CH 3 ) 2 where R is alkyl group 
n = 0, 1, 2 ... 
R = H, alkyl 

R' = Ph or N-heterocycle, H. alkyl, SO3H, OH, CN, CI, Br. N0 2 , NH 2 , N(R) 2 , OR where R is alkyl group 



5 [0081] FIG, 4 is a schematic illustration of solution phase based DPC fluorescence assay for 
mutiplex detection involving multiple analytes. Each pair is labeled with non-fluorescence 
precursors complementary to a sequence that is indicative of a specific polymorphic site or 
genotype in the diagnostic determination of infection with a virus which can be PCR-amplified 
products. After hybridization, DPC fluorescence product will be formed. By changing the 
1 0 hybridization conditions such as salt and temperature, only matched pairs will generate the 

fluorescence signal. No washing process is needed to remove any non-fluorescence precursors. 
This method is not only useful for analyzing or detecting polynucleotide sequences, it also can 
be used, for example, in an antibody based assay utilizing a nucleic acid conjugated antibody. 

[0082] FIG. 5 is a schematic illustration of a solid phase based DPC fluorescence assay for 
1 5 multiple analytes. Different solid supports can be used to immobilize genes of interest, such as 
a glass plate, a polymer or a gold plate. After hybridization and fluorescence compound 
formation by catalyzed DPC, the solid support can be visualized directly by fluorescence 
microscopy or detected by a fluorescence reader without the need for washing. 

[0083] FIG. 6 illustrates a multiplexed immunohistochemistry (IHC) test for multiple family 
20 receptor dimers. Multiple pairs of probes, each pair being directed at a particular homo- or 

hetero dimer and with a distinct DPC product (e.g., a distinct fluorescent signal from each pair 
as shown), can provide simultaneous detection and profiling of multiple receptor dimers. Both 
zip-coded, as illustrated in FIG. 7, and non-zip-coded probe pairs can be employed in a 

multiplex test. See, e.g., PCT International Application No. PCT/US07/ , titled 

25 "Receptor Family Profiling," by Landsman et aL, filed September 18, 2007 in the U.S. 
Receiving Office. 
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[0084] There are a number of advantages to the detection methods disclosed here. 1) Zero 
background. There is no background fluorescence signal for aldehyde and quaternary salt 
precursors when excited at the polymethine product's excitation wavelength. Since the DPC 
reaction only happens in the presence of the catalyst and the aldehyde and quaternary salt 
5 precursors are very stable, no decomposition of precursors will be observed and thus there is no 
background fluorescence of the decomposed product. 2) Simplicity. The fluorescence 
generation is performed in one-pot and the detection is achieved in situ without isolation of the 
product. 3) Specificity. The fluorescence generation is based on the sequence specific nucleic 
acid interaction, so the signal generation is specific to the nucleic acid sequence. 4) A larger 
10 number of analytes can be detected as compared to conventional methods. The fluorescence 
emission wavelength of polymethine dye can be easily tuned from far UV to near IR, so multi- 
wavelength dyes can be generated in one-pot by utilizing multi-codon DNAs. The numbers of 
analyte that can be detected are not limited by the DPC and dye chemistry. 

[0085] Various and general aspects of nucleic acid-templated chemistry are discussed in 
15 detail below. Additional information may be found in U.S. Patent No. 7,070,928, and 

7,223,545, European Patent No. 1,423,400 Bl and U.S. Application Publication No. 

2004/0180412 Al (USSN 10/643,752) by Liu et al Gartner, et al, 2004, Science, vol. 305, pp. 

1601-1605; Doyon, et al, 2003, JACS, vol. 125, pp. 12372-12373, all of which are expressly 

incorporated herein by reference in their entireties. See, also, "Turn Over Probes and Use 
20 Thereof by Coull et al., PCT WO07/008276A2, filed on May 3, 2006. 

DPC-BASED PROTEIN DETECTION 

[0086] Methods and compositions of biodetection using nucleic acid-templated chemistry 
based probes are described in WO06128138A2 by Coull et al, which is incorporated herein by 
reference in its entirety. 

25 [0087] FIG. 8 and FIG. 9 illustrate one embodiment of the invention for the detection of a 
protein target. 

[0088] FIG. 8 shows an embodiment of detection of a protein target by DPC-based probes. 
Two probes contain target binding moieties, complementary oligonucleotides, and chemically 
reactive species X and Y, respectively. Upon hybridization, X and Y react to create a signal 
30 generating (e.g., fluorescent) compound, which may or may not covalently link both probes. 
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The reaction product of X and Y may also be released as an unbound, soluble compound into 
the solution. The protein target may be attached to a solid-phase such as the surface of a bead, 
glass slide (microarray), etc., or be in solution. The target binding moieties may be aptamers, 
antibodies, antibody fragments (i.e., Fab), receptor proteins, or small molecules, for example. 

5 [0089] More particularly illustrated in FIG. 9 is an example of the dual-probe approach with 
two probes, each carrying a "prefluorophore" precursor (Rl and R2) and containing a binding 
moiety for a target and an oligonucleotide sequence that is designed to anneal to each other. In 
this embodiment, the detection is performed under conditions such that the prefluorophore 
oligos will not anneal to each other in the absence of a target. These conditions are generally 

10 selected such that the ambient temperature is higher than the T m of the oligonucleotide pairs in 
the absence of the target (so that the oligo pairs will not anneal in the absence of the intended 
target analyte). In the presence of the intended target, however, the localized high 
concentration of the oligos then shifts the T m of their double stranded complex upwards so that 
hybridization occurs, which is followed by a signal-generating nucleic acid-templated reaction 

15 (a reaction between Rl and R2). The signal-generating nucleic acid-templated reaction is 

accelerated both due to the localized higher concentration of the prefluorophores, but may also 
be facilitated by the proximity and orientation of the prefluorophore groups towards one 
another. This configuration of signal generation has the potential to enable creation of kits for 
the detection of various biomolecules, cells, surfaces and for the design of in situ assays. The 

20 signal generation does not require enzymes and the homogeneous format requires no sample 
manipulation. 

[0090] In FIG. 9, two oligonucleotides are shown, each of which is linked through an 
optional spacer arm to a separate binder, as shown in this case is an antibody but may be other 
binders such as aptamers or small molecules. Each antibody recognizes a separate epitope on a 

25 common target analyte such as a protein. Spacer arms can be added to one or both 

oligonucleotides between the oligo and the binder. In certain cases, this spacer arm may be 
required to meet proximity requirements to achieve a desired reactivity. Spacer arms in 
principle can be any suitable groups, for example, linear or branched aliphatic carbon chains C3 
to C5, C10, C15, C20, C25, C30, C35, C40, or C100 groups, a DNA sequence of 1 to 10, 15, 

30 20, 30, 50 or 100 bases long, or polyethylene glycol oligomers of the appropriate length. 
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[0091] The prefluorophores may reside in an "end of helix" configuration (FIG. 9 top), one 
attached to the 5' end of an oligo and other to the 3' end. (Other configurations can be applied, 
including placing the two prefluorophores within the sequence or having one oligo hybridize to 
a partial hairpin structure (e.g., 100 Angstroms long), for example.) In the first example, one 
5 oligonucleotide is attached to a 5' to a spacer arm and a target binder, and the other 3' is 
attached to a spacer arm and separate target binder. Spacer arms, which can consist of non- 
complementary DNA sequences, or synthetic spacer arms such as oligomers of ethylene glycol, 
can be added to meet proximity requirements. Such spacer arms can be very flexible, which 
has the advantage of overcoming any steric hindrance to binding that might occur with a rigid 

10 spacer. A suitably long spacer arm design can permit both oligonucleotides to be linked 5' to 
their binders (FIG. 9 bottom), or both linked 3', as long as the oligonucleotides can anneal in 
the antiparallel configuration and allow the reactive groups to react with each other. An 
optimal spacer arm length may be designed for each target. Spacer arms which are excessively 
long should be avoided as they may reduce specificity in the system or a reduced increased T m 

15 effect. 

[0092] The proximity effect afforded by tethering the pair of oligonucleotides may affect the 
kinetics of annealing of two complementary oligonucleotide sequences compared to the two 
oligonucleotides free in solution. More importantly, a localized high concentration shifts the 
melting curve upwards compared to the free complex, i.e. increase the T m of the complex. In a 
20 bulk solution, it is known that T m has dependence upon total oligonucleotide concentration as 
illustrated in the equation below. Wetmur, Critl Rev. in Biochem. And Mol. Biol., 1991, 26, 
227-259. 

T m = (1000* AH) / (A + AS + R ln(C t /4)-273.15 + 16.6 log Na + ) 

where AH and AS are the enthalpy and entropy for helix formation, R is the molar gas constant, 
25 C t is the total concentration of oligomers, and Na + is the molar concentration of sodium ion in 
the solution. 

[0093] FIG. 10 shows that the slope of T m vs. concentration within the range of short 
oligonucleotides in 0.1 M salt has a dependence of about +7° C per 10-fold increase in 
concentration of oligonucleotides (sequences in FIG. 11) based on the above equation. So, for 
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example, a 1000-fold increase in local concentration would be expected to raise T m by about 
+21° C. 

[0094] Reaction products of Rl and R2 may be released from the hybridization complex as a 
result of the chemical transformation. Thus, the fluorophore or chromophore may be separated 
5 from the hybridization complex and analyzed independently, or the fluorophore or 

chromophore and the annealed oligonucleotides may be removed once detected so that 
additional rounds of interrogation of the sample can be conducted. The reaction between Rl 
and R2 may or may not be covalently linked to the two probes once the product(s) is formed. 

[0095] FIG. 12 illustrates another embodiment of the invention, which employs a "zip- 
10 coded" splint architecture for nucleic acid template-based biodetection. In this embodiment, 
instead of the target binding moieties being directly linked (optionally via spacer groups) to the 
complementary oligonucleotides that hybridize and set up nucleic acid templated reactions, the 
target binding moieties is linked to a "zip code" oligonucleotide sequence. Each of the 
corresponding reporter oligonucleotides has a complementary, "anti-zip code" sequence (in 
1 5 addition to a "reporter" sequence that sets up nucleic acid-templated reaction). The nucleic 
acid-templated chemical reactions are set up by the hybridization of the reporter oligos, which 
are linked to reactive groups that react and generate detectable signals. It is important that each 
oligonucleotide sequence of the probes is complementary only to its intended hybridization 
partner and not complementary to other oligonucleotides in the detection system. 

20 [0096] This zip-coded architecture supports creating a single reporter-oligonucleotide 

conjugate which would assemble with different downstream reporter oligonucleotides through 
an anti-zip code sequence. Libraries of different reporters linked to a unique anti-zip code may 
be tested simply by mixing each one with stoicheometric amounts of the binder-zip code 
oligonucleotide conjugate with its complementary zip code. 

25 [0097] FIG. 13 is an illustration of a zip-coded splinted architecture approach where the 
target binding moieties are two aptamers. In this example for detection of platelet derived 
growth factor (PDGF) with illustrative oligo sequences and reporter chemistry (e.g., 
triphenylphosphine, TPP, and 7-azidocoumarin, AzC), the TPP reporter oligonucleotide self- 
assembles to the PDGF aptamer oligonucleotide through hybridization of zip code sequence 

30 (NNN ) to the complementary anti zip code sequence (N'N'N' ) on the TPP reporter 
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oligonucleotide. The reporter oligonucleotide terminates with an exemplary 10-base reporter 
sequence and a 5'-TPP group. A separate pair of oligonucleotides, with different zip codes and 
anti-zip codes (complementary to each other pairwise), also self-assembles to provide the AzC 
reporter sequence and a 3' -AzC group. The AzC oligonucleotides are complementary and 
5 antiparallel to the TPP oligonucleotides so the TPP and AzC groups terminate end-to-end when 
the TPP and AzC oligonucleotides anneal to each other. 

[0098] FIG. 14 illustrates in more detail the zip-coded splinted architecture approach for 
detection of PDGF with illustrative oligo sequences and reporter chemistry (TPP and AzC). 
The TPP pair includes, first, a PDGF-aptamer on the 5 '-end, a CI 8 polyethylene-glycol based 

10 spacer, and an 18-mer zip code sequence. The TPP reporter sequence includes a 

complementary anti-zip code sequence on its 3' terminus, a CI 8 PEG spacer, and a ten base 
pair reporter sequence terminating in a 5' TPP group. The AzC pair of oligonucleotides 
includes a 3'-aptamer linked through a CI 8 PEG spacer to a separate zip code, and a detection 
oligonucleotide linked to a 5' anti-zip code, a CI 8 PEG spacer, and a reporter oligonucleotide 

15 (complementary to the TPP oligonucleotide) terminating in a 3' AzC group. 

[0099] FIG. 15 illustrates an example of the corresponding architect where antibodies are 
used instead of aptamers as target binding moieties. 

[0100] One advantage of the "zip coded" approach is the ability to create the reporter 
oligonucleotides separately, and have them assemble together with binders under conditions 
20 retaining the activities of both the binders and of the nucleic acid template-activated chemistry. 

[0101] The zip-coded system is based upon two pairs of oligonucleotides, with each pair 
being held together by the base-pairing of a unique zip code and an anti-zip code pair. "Zip 
codes" are oligonucleotide sequences which bind specifically to their complementary 
sequences, and preferably are designed such they are not complementary to known genomic 
25 sequences (relevant if the sample may contain genomic DNA), have similar T m values, lack 
significant secondary structure, and do not anneal to other zip code or anti-zip code sequences 
in the detection system. 

[0102] It is worth pointing out the methods of the invention do not require enzymatic or 
chemical ligation of the first and/or the second oligonucleotide sequences. 
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[0103] Factors that may be considered in optimizing a design of a zip-coded architecture 
include, for example, (1) spacer groups (e.g., oligonucleotides and/or non-base groups) between 
the aptamer/antibody and zip codes (spacer 1), e.g., to allow hybridization partners to reach 
each other, to prevent any steric hindrance; (2) Length of a zip code sequence in order to form a 
5 sufficiently stable annealing to the anti-zip code sequence to form the complex; and (3) Spacer 
groups (spacer 2) between the anti-zip code and the reporter sequence, e.g., to prevent any 
steric hindrance. 

[0104] The binders (target binding moieties) attached to the oligonucleotides may be any 
chemical moieties that specifically bind to a target molecule and allow the design of the 

10 invention to work. Examples include a wide range of functionalities, such as (1) antibodies: 
e.g., IgG, IgM, IgA, IgE, Fab's, Fab', F(ab) 2 , Dab, Fv or ScFv fragments; (2) small molecule 
binders, such as inhibitors, drugs, cofactors; (3) receptors for protein detection, and vice versa; 
(4) DNA, RNA, PNA aptamers; (5) DNA sequences for DNA-binding and regulatory proteins; 
(6) peptides representing protein binding motifs; (7) peptides discovered through phage display, 

1 5 random synthesis, mutagenesis; (8) naturally binding protein pairs and complexes; (9) antigens 
(for antibody detection); and (10) a single polyclonal antibody separately attached to two 
oligonucleotides may serve as two separate binders of different specificity. 

[0105] The target binding moieties attached to the oligonucleotides may be of heterogeneous 
types directed against different sites within the same target. For example, the two binders may 
20 be two different antibodies, an antibody and a receptor, an antibody and a small molecule 
binder, a receptor and a peptide, an aptamer and a cofactor, or any other combination. 

[0106] The target analytes can be of any type, provided the target supports two (or more) 
binding sites. Molecules which exist in equilibrium with a monomeric form and a 
homodimeric or higher polymerization phase may be detected by a pair of probes containing 
25 the same binder but different complementary DNA sequences. Suitable targets include 

proteins, cell surfaces, antibodies, antigens, viruses, bacteria, organic surfaces, membranes, 
organelles, in situ analysis of fixed cells, protein complexes. The invention may be particularly 
suited for the detection of fusion proteins (e.g., BCR-ABL in the presence of BCR and ABL). 
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[0107] In the design of the probes, one consideration is the T m of the two reporter sequences 
carrying the reactive groups. Since the T m of the duplex should be below room temperature in 
the absence of a target, this sequence normally should be short, for example 6-15 bases and/or 
A-T rich. A typical reporter length of 1 0 base pairs might have a T m of around 30°C at a low 
5 salt concentration. Therefore, it is often necessary even with a short sequence to add 10% to 
40% volume/volume formamide to further lower the temperature below assay temperature, or 
to elevate the assay temperature. Very short reporter oligonucleotides may suffer from a lack 
of specificity and exhibit some binding to zip code sequences (when these are employed) which 
is undesirable. 

1 0 [0108] Another factor in the design of the probes is the length of oligonucleotide in between 
the binding moiety and the reporter sequence, including any zip code sequences. These must 
be long enough for the reporter oligonucleotides to reach each other and anneal. The sequences 
may be interspersed with polyethylene glycol (PEG) linkers that are flexible and may afford 
additional protection against any steric hindrance. For example, total lengths of 

15 oligonucleotides may be around 35 bases long. Oligonucleotides containing 0, 1, or 2 CI 8 
PEG spacers, or homopolymer tracts may also be utilized (i.e. Cio). 

[0109] A third consideration is the length of zip and anti-zip sequences when these are 
employed (i.e. FIG. 13 and FIG. 16). Aside from the need for each zip code to anneal only to 
its anti-zip code, and not any other zip code, anti-zip code, or reporter sequence, an important 
20 parameter is the T m of the duplex between the zip codes and anti-zip codes. The T m should be 
substantially higher than the highest temperature that will be used in the assay in order that the 
reporter oligonucleotides remain firmly attached to the binding moiety. In practice, zip codes 
of about twice the length of the reporter sequences (i.e. total length of 15-30 bases) are 
desirable and generally meet these criteria. 

25 [0110] Regarding signal generation, nucleic acid-templated chemistry may be used to create 
or destroy a label that effects an optical signal, e.g., creating or destroying a fluorescent, 
chemiluminescent, or colorimetric molecule. Additionally, a detection reaction may be 
designed to create or destroy a product that directly or indirectly creates a detectable label, for 
example, a product that catalyzes a reaction that creates an optical label; inhibits a reaction that 

30 creates an optical label; is a fluorescence quencher; is a fluorescent energy transfer molecule; 
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creates a Ramen label; creates an electrochemiluminescent label (i.e. ruthernium bipyridyl); 
produces an electron spin label molecule. 

[0111] The following examples contain important additional information, exemplification 
and guidance that can be adapted to the practice of this invention in its various embodiments 
5 and equivalents thereof. Practice of the invention will be more fully understood from these 

following examples, which are presented herein for illustrative purpose only, and should not be 
construed as limiting in anyway. 

EXAMPLES 

[0112] Examples 1 to 4 are related to DNA probe preparation. Both the aldehyde and 
10 heterocyclic precursors bearing an active hydrogen component can be conjugated to DNA 

through amide bond formation. First, an acid heterocyclic or aromatic precursor is synthesized. 
The acid is then converted to the active N-hydroxysucciimide ester (NHS ester) that readily 
reacts with DNA bearing amine functionality. 

[0113] Oligonucleotides were prepared using standard phosphoramidite chemistry and 
15 purified by reversed-phase CI 8 column (Glen Research, Sterling VA, USA). Oligonucleotides 
bearing 5 '-amino groups were prepared using 5'-Amino-Modifier 5 and oligonucleotides 
bearing 3'-aminogroups were prepared using 3'-Amino-Modifier C7 CPG (Glen Research, 
Sterling VA, USA). Concentration of the DNA and heterocyclic conjugated DNA was 
determined by UV absorbance at 260 nm. The contribution of the UV absorbance at 260 nm 
20 from the heterocyclic moiety in the heterocyclic conjugated DNA was negligible and was not 
considered. 

Table 2 Oligonucleotide Sequences 

Oligo sequence (5 f -3 f ID No. 

Zip2 TTGGTGCTCGAGTCCCCCCCCCCCCCCCCCCCCCC-NH 2 (SEQ ID NO: 66) 

Zip3 NH2-CCCCCCCCCCCCCCCCCCCCGCTGCCATCGATGGT (SEQ ID NO: 67) 

Zip5 NH2-CCCCCCCCCCCCCCCCCCGTGCCATCCATAGTCAG (SEQ ID NO: 68) 

Antizip2 reported GGACTCGAGCACCAATAC-X-TATAAATTCG-NH2 (SEQ ID NO: 
69) 

Antizip3 reporter NH2-CGAATTTATA-X-CTGACCATCGATGGCAGC (SEQ ID NO: 

70) 

AntizipS reporter mismatch NH2-CCAATTAATA-X-CTGACTATGGATGGCACG (SEQ ID 
NO: 71) 
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AntizipS reporter NH2-CGAATTTATA-X-CTGACTATGGATGGCACG (SEQ ID NO: 

72) 

Antizip2 report2 GGACTCGAGCACCAATACXTATAAATTCGCCC (SEQ ID NO: 

73) 

5 X = Spacer Phosphoramidite 1 8 (Glen Research, Sterling VA, USA) 

EDC 1 GTGGTAGTTGGAGCTGGTGGCGTAGGCAAG (SEQ ID NO: 74) 

EDC2 H2N-AGCTC CAACTACCAC (SEQ ID NO: 75) 

EDC3 H2N -AGATCCCACTAGCAC (SEQ ID NO: 76) 

EDC4 GTGGTAGTT GGAGCT-NH2 (SEQ ID NO: 77) 

1 0 EDC5 TCTTGCCTACGCCAC -NH2 (SEQ ID NO: 78) 

EDC7 NH2 -ACCCTTGAACACGTC (SEQ ID NO: 79) 

EDC8 TCTCCGTTGCCGCTC-NH2 (SEQ ID NO: 80) 

EDC 10 GTGGTAGTTGGAGCTGGAGCGGCAACGGAGA (SEQ ID NO: 81) 

EDC 1 1 GACGTGTTC AAGGGTGGTGGCGTAGGC AAGA (SEQ ID NO: 82) 

1 5 EDC 1 2 GACGTGTTC AAGGGTGGAGCGGCAACGGAGA (SEQ ID NO: 83) 



Example 1 DNA conjugated quaternary salt 

[0114] Scheme 3 gives one example of synthesizing DNA conjugated quaternary salt bearing 
active hydrogen component (indoliniumDNA). 2,3,3-trimethylindolenine is commercially 
20 available. The acid functionality is introduced to the indoline ring through N-quaternization. 



Scheme 3: Example of synthesizing DNA conjugated quaternary 
salt bearing active hydrogen component (indolinium_DNA). 




35% yield 2 |r 

lndolinium_DNA 



25 [0115] Synthesis of compound 1: To 5 -bromo valeric acid (2.435 g, 13.45 mmole) was added 
2,3,3-trimethylindolenine (2.141 g, 13.45 mmole). The reaction mixture was heated with 
rigorous stirring at 1 10 °C overnight. The dark red sticky oil obtained was transfered to a 
Gregar extractor and extracted with EtOAc overnight. A light red solid was obtained. The solid 
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was redissolved in 30 mL of MeOH. MeOH was removed under reduced pressure and the 
remaining residue was treated with 10 mL of EtOAc. Browish solid was precipitated out and 
filtrated. The solid was washed with 2 x 50 mL of acetone and 2 x 100 mL of EtOAc. Total 
1.590 g of light brownish solid was obtained (35% yield). ! H NMR (DMSO) 8 ppm : 7.98 (m, 
5 1H), 7.84 (m, 1H), 7.61 (m, 2H), 4.49 (t, 2H), 2.84 (s, 3H), 2.30 (t, 2H), 1.84 (m, 2H), 1.63 (m, 
2H), 1.53 (s, 6H). MALDI-MS (positive mode): 260.2419. 

[0116] Synthesis of compound 2: Compound 1 (0.1 g, 0.294 mmole), N-hydroxy succimide 
(0.068 g, 0.588 mmole) and N, yV'-dicyclohexylcarbodiimide (DCC) (0.085 g, 0.41 1 mmole) 
were dissolved in 1.5 mL of dimethyl formamide (DMF). The reaction mixture was stirred at 
10 37 °C for 1 hr. The precipitated dicyclohexylurea (DCU) was removed by filtration, and the 
filtrate was treated with 1 5 mL of ether. Light orange solid was washed three times with 1 0 
mL of ether and dried under vacuum for several hours. The solid obtained was used directly for 
the next reaction. MALDI-MS (positive mode): 357.1590. 

[0117] Labeling DNA with indolinium compound: To a 1 .5 mL of centrifugation vial 

1 5 containing 20 nmole of DNA was added 4 1 .6 uL of 0. 1 M sodium phosphate buffer (NaPi), pH 
7.8, 41.6 uL of compound 2 in jV-methyl 2-pyrrolidone (NMP) (96 mM) and 41.6 uL of NMP. 
The vial was placed in a shaker and shaked for 4 hr at 37 °C. The reaction mixture was desalted 
by gel filtration using Sephadex G-25 and then purified by reversed-phase C8 column. 
Indolinium_EDC2 (DNA: SEQ ID NO: 75): 15% yield. LC-MS (negative mode): Calcd for 

20 Ci 63 H2i2N5709oPi5 (monoisotopic): 1216.7379 [M-5H]" 4 ; Found: 1216.9552 [M-5H] 4 "; 

Indolinium_EDC4 (DNA: SEQ ID NO: 77): 22% yield. LC-MS (negative mode): Calcd for 
Ci72H 2 2iN 60 O96Pi5(monoisotpic): 1280.5002 [M-5H]" 4 ; Found: 1280.7356 [M-5H]" 4 ; 
Indolinium_EDC5 (DNA: SEQ ID NO: 78): 15% yield. LC-MS (negative mode): Calcd for 
C166H220N51O95P15 (monoisotpic): 1226.7426 [M-5H] -4 ; Found: 1226.9657 [M-5H]- 4 . 

25 Indolinium_antizip5 (DNA: SEQ ID NO: 72): 10% yield. LC-MS (negative mode): Calcd for 
C 3 o7H4oiN 10 80i8oP29 (average): 1340.2614 [M-8H] 7 "; Found: 1340.2705 [M-8H] 7 ". Indolinium_ 
antizip5m (DNA: SEQ ID NO 71): 10% yield. LC-MS (negative mode): Calcd for 
C308H404N109O177P29 (monoisotopic): 1336.4969 [M-8H]7-; Found: 1336.673 [M-8H] 7- 
Indolinium_ antizip3 (DNA: SEQ ID NO: 70): 5% yield. LC-MS (negative mode): Calcd for 

30 C307H404N107O178P29 (monoisotopic): 1332.4034 [M-8H]7-; Found: 1332.6293 [M-8H]7-. 
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Example 2 DNA conjugated quaternary salt 

[0118] Scheme 4 provides another example of synthesizing DNA conjugated quaternary salt 
bearing active hydrogen component (benzoindoliniumDNA) following the similar route as 
indolinium DNA. 



[0119] Synthesis of compound 3: (same procedure of synthesizing compound 1): 1,1,2- 
10 trimethy-1 H-benzoindole (2.73 g, 13 mmole) and 5 -bromo valeric acid (2.36 g, 13 mmole) was 
heated with rigorous stirring at 1 10 °C overnight. Total 3.016 g of 4 was obtained after working 
up as off-white solid (59% yield). ] H NMR (CD 3 OD) 6 ppm : 8.30 (m), 8.15 (m), 8.05 (d), 7.7 
(m), 4.66 (t, 2H), 2.45 (t, 2H), 2.1 (m, 2H), 1.85 (m, 2H), 1.85 (s, 3H), 1.83 (s, 6H). MALDI- 
MS (positive mode): 310.209. 

15 [0120] Synthesis of compound 4: Compound 4 was synthesized following the same 

procedure of synthesizing compound 2 and was used directly for labeling DNA after ether 
precipitation. 

[0121] Labeling DNA with benzoindolinium compound: Following the same procedure of 
synthesizing indoliniumJDNA, total 1 1.2 nmole of Benzoindolinium_EDC7 (DNA: SEQ ID 
20 NO: 79) was obtained starting from 50 nmole of EDC7: 22% yield. LC-MS (negative mode): 
Calculated for Ci68H 2 i6N560 9 2Pi5 + (monoisotopic): 1650.0003 [M-4H]" 3 ; Found: 1650.0359 
[M-4H]3". 

Example 3 DNA conjugated aldehyde 

[0122] Scheme 5 provides one example of synthesizing DNA conjugated aldehyde. The acid 
25 functionality in aldehyde precursor is introduced through hydrolysis of a cyano group by 
hydrogen peroxide (Brady, J. D.; Robins, S. P. J. Bio. Chem. 2001, 276, 18812-18818.). 



5 



Scheme 4: Example of synthesizing DNA conjugated quaternary salt 
bearing active hydrogen component (benzoindolinium_DNA). 




X | DNA | 
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Scheme 5: Example of synthesizing DNA conjugated aldehyde (A0_DNA). 

CHO O ? HO 

/=r° rS Cn-oh ( 



y V 5 M NaOH, 6% H 2 0 2 dna 

— N reflux,2 hr ^ > DCC.D MF NaPi, pH B.S. NMP ^ \ 

\ CN XOOH 37 ° C - 1 hr overnight at 37 °C \* 

A/-methy1-A/- r ^ HN^D^ 

cyanoethyt-4- 5 

amino benz aldehyde 62% yield 

Aldehyde_DNA 



[0123] Synthesis of compound 5: In a 50 mL of round-shaped flask containing N-methyl-N- 
cyanoethyl-4-aminobenzaldehyde (1.024 g, 5.44 mmole) was added 27.2 mL of 5 N NaOH 
5 solution and 6.8 mL of 30% H2O2. The reaction mixture was refluxed for 2 hr. After cooling 
down, the reaction mixture was neutralized by the addition of concentrated HC1 (37% w.t.) and 
extracted with 2 x 100 mL of EtOAc and 1 x 100 mL of CH2CI2. The organic layers were 
combined and washed once with 50 mL of brine and concentrated to dryness. The crude 
product was purified by a 40 g RediSep silica-gel column on a CombiFlash Companion 
10 chromatography system (EtOAc/MeOH). Total 0.702 g of light pinkish solid was obtained 
(62%). Electrospray MS: M+H 208.0735. (Brady, et al„ J. Biol Chem. 2001, 276, 18812- 
18818). 

[0124] Labeling DNA with aldehyde: The NHS ester of 5 was synthesized following the 
same procedure of compound 2. After removing the DCU by filtration, the filtrate was used 

15 directly for DNA conjugation (calculated as 0.2 M product in DMF). To a 1.5 mL of 

centrifugation vial containing 50 nmole of DNA was added 104 jjJL of 0.1 M NaPi, pH 8.6, 125 
|4,L of the above filtrate and 83 j^L of NMP. The vial was placed in a shaker and shaked 
overnight at 37 °C. The reaction mixture was desalted by gel filtration using Sephadex G-25 
and then purified by reversed-phase C8 column. Aldehyde_EDC2 (DNA: SEQ ID NO: 75) 

20 (44% yield). LC-MS: Calcd for C^I^NstO^Pis (monoisotopic): 1203.9710 [M-4H] 4 -; 
1605.6306 [M-3H] 3 ~ Found: 1203.9664 [M-4H] 4 "; 1605.6305 [M-3H] 3 '; Aldehyde_EDC3 
(DNA: SEQ ID NO: 76): (49% yield). LC-MS: Calcd for C159H204N59O91P15 (monoisotopic): 
1213.9725 [M-4H] 4 "; 1618.9660 [M-3H] 3 " Found: 1213.9620 [M-4H] 4- ; 1618.9590 [M-3H] 3 . 
Aldehyde_ antizip2 reporterl (DNA: SEQ ID NO: 69) (30% yield). LC-MS: Calcd for 

25 C303H396N110O177P29 (monoisotopic): 1328.2458 [M-7H]7"; Found: 1328.3051 [M-7H] 7 \ 
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Example 4 DNA labeled a,p-unsaturated aldehyde 

[0125] Scheme 6 provides an example of synthesizing DNA labeled a,P-unsaturated 
aldehyde 1 . Wittig reagent was used for the two-carbon homologation of aldehydes into the 
corresponding a, (3-enals (Eitel, M.; Pindur, U. Synthesis 1989, 364-367. The acid fiinctionality 
in aldehyde precursor is introduced through hydrolysis of a cyano group by concentrated HC1 
(Bratenko, M. K.; Chornous, V. A.; Vovk, M. V. Chemistry of Heterocyclic Compounds 2004, 
40, 1279-1282). 

Scheme 6: Example of synthesizing DNA conjugated aldehyde (A1_DNA). 



^ CHO 

C° I'M pi ^ N " 0H 

\J Ph3P=CHCHO ConcH CI O 




— N ™.™ ~ ZZ~Z> ' >i 1N- ^ DCC.DMF NaPi.p H 8.6, NMP 
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^ Toluene, reflux, overnight |^ 1 " \ 

CN CN k C0QH 

A/-methyt-rV- 6 _ 

cyanoethyM- 

aminobenzaldehyde 21 %yieid 100% 




Aldehyde1_DNA 



10 [0126] Synthesis of compound 6: In a 100 mL of round-shaped flask containing iV-methyl-N- 
cyanoethyl-4-aminobenzaldehyde (1.116 g, 5.9 mmole) and ylide (2.71 g, 8.9 mmole) was 
added 57 mL of dry toluene. The reaction mixture is heated under reflux for overnight, allowed 
to cool, and filtered through filter paper. After removing the solvent from the filtrate, the 
residue was first purified by a 40 g RediSep silica-gel column on a CombiFlash Companion 

15 chromatography system (Toluene/Ether) and then Preparative HPLC CI 8 column (Agilent 

Prep-C18, 30 x 100 mm, 10 urn) to afford 0.27 g of pure product (21%). MALDI-MS (positive 
mode): 215.226. 

[0127] Synthesis of compound 7: In a 50 mL of round-shpaed flask containing compound 6 
(0.1 g, 0.47 mmole) was added 30 mL of concentrated HC1. The reaction mixture was heating 
20 to boiling and left at room temperature (RT) for 1 hr. HPLC analysis indicated that only one 
product was formed and no starting material remained in the reaction mixture. After removing 
most of the HC1, the compound was dissolved in water and lyopholyzed to dryness to afford the 
product. 
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[0128] Labeling DNA with a,p-unsaturated aldehyde: The NHS ester of compound 7 was 
synthesized following the same procedure as compound 2, however was purified by silica-gel 
chromatography (EtOAc/Hexanes) instead. After drying under vacuum for several hours, the 
NHS ester of compound 7 was dissolved in NMP (96 raM) and was used to label DNA 
5 following the same procedure as labeling aldehyde_DNA. Aldehyde 1 JEDC8 (DNA: SEQ ID 
NO: 80): yield 46%. Calcd for C163H215N48O99P15 (monoisotopic): 1629.9698 [M-3H]" 3 ; Found: 
1629.9995 [M-3H]3". Aldehydel_ antizip2 reporterl (DNA: SEQ ID NO: 69): yield 40 %. 
Calcd for C 3 o5H398N,,oOi7 7 P29 (monoisotopic): 1331.96239 [M-7H]" 7 ; Found: 1332.0778 [M- 
7H] -7 . 

10 [0129] Example 5 to 8 are related to the preparation of indole and indolinium analogues for 
DNA conjugation. 

[0130] Indole analogues can be synthesized following the general Fischer-indole synthesis by 
converting aryl hydrazones to indoles under acidic conditions (Scheme 7). First, a primary 
aromatic amine and nitrous acid reacts to give a diazonium salt. The diazonium salt is then 
15 reduced to a hydrazine (Hunsberger et. al. J. Org. Chem. 1956, 27, 394—399). Finally hydrazine 
reacts with 3-methylbutan-2-one to form the aryl hydrazone which upon isomerization and 
elimination of NH 3 forms indole (Lindsey et. al. Tetrahedron 1989, 45, 4845-4866). 

Scheme 7: General synthetic routes to the indole and indolinium analogues. 




OH 

2,3,3-trimethyl-3H- 

benzo[g]indole-5- 3-(4-carboxy buty l)-1 , 1 ,2-trimethy I- 

sulfonic acid 1H-benzo[e]indolium-7-sulfonate 
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Example 5: Synthesis of 5-methoxy-2,33-trimethyl-3H-indole 

[0131] To a solution of 4-methoxyaniline (2.46 g, 20 mmol) in 60 mL of cone. HC1 was 
added dropwise the solution of NaN0 2 (1.38 g, 20 mmol) in 35 mL of H 2 0 at 0 °C. After 
stirring for 0.5 h at 0 °C, the reaction mixture was added dropwise to a solution of SnCl 2 (9.03 
5 g, 40 mmol) in 35 mL of cone. HC1 at 0 °C, then stirring was continued for 1.5 h at 0 °C. 2N 
NaOH was then added to quench the reaction until pH = 9 to 10. The aqueous layer was 
extracted with DCM (50 mL x 3) and the organic layer was dried over Na 2 S0 4 . After filtration 
and concentration, the desired product, (4-methoxyphenyl)hydrazine was obtained (0.98 g, 35 
% yield), which was used directly for next step. 

10 [0132] A mixture of (4-methoxyphenyl)hydrazine (0.98 g, 7.1 mmol) and 3-methylbutan-2- 
one (1.53 g, 17.8 mmol) in 20 mL of HO Ac was heated at 100 °C overnight. The mixture was 
concentrated and IN NaOH was added until pH 9 to 10. The aqueous solution was extracted 
with ethyl acetate (50 mL x 3). The combined organic layers were dried over Na 2 S04. After 
filtration and concentration, the residue was purified by flash column chromatography to give 

15 610 mg of 5-methoxy-2,3,3-trimethyl-3H-indole (45 %). *H NMR (CDC1 3 ) 6 ppm : 7.43 (dd, 1H), 
6.81 (m, 2H), 3.83 (s, 3H), 2.26 (s, 3H), 1.28 (s, 6H). LC-MS (M+H): 190.16 

Example 6: Synthesis of 4(6)-nitro-2,3,3-trimethyl-3H-indole 

[0133] (3-nitrophenyl)hydrazine (MW=189.6, 685 mg, 3.6 mmol) and 0.75 ml 3-methyl-2- 
butanone were stirred in 8 mL of EtOH at RT for 10 minutes, then at 40 °C for 15 minutes. 

20 Ethanol was removed under reduced pressure. The residue was taken up in 20 mL of cone. HC1 
and heated at 100 °C for 2h. The aq. HC1 was then removed under reduced pressure. The solid 
was triturated with 2 mL of iced water, filtered and washed with 2 mL of iced water. After 
drying in the air, the solid weighed 300 mg. However, TLC indicated that it contained a 
mixture of two products. The aqueous layers were neutralized with IN NaOH to pH~8.0, 

25 extracted with EtOAc (50 mL x 3). The organic layers and the solid obtained previously were 
combined, dried over Na 2 S04 overnight. The organic solution was filtered, washed with 
EtOAc, and concentrated under reduced pressure. The residue was purified by flash 
chromatography (Si0 2 , 70 g), eluted with 8-15% EtOAc in hexane to give two fractions. The 
first fraction was obtained in 217 mg (30% yield) as a light yellow oil, which was identified as 

30 compound 4-nitro-2,3,3-trimethyl-3H-indole by 1H NMR. [ H NMR (CDC1 3 ) 5 ppm : 8.0 (dd, 
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1H), 7.89 (dd, 1H), 7.50 (t, 1H), 2.73 (s, 3H), 1.50 (s, 6H). The second fraction was obtained in 
245 mg (33% yield) as a yellow solid, which was identified as 6-nitro-2,3,3-trimethyl-3H- 
inodle by 1H NMR. *H NMR (CDC1 3 ) 6 ppm : 8.33 (d, 1H), 8.12 (dd, 1H), 7.50 (t, 1H), 7.38 (d, 
1H), 2.33 (s, 3H), 1.27 (s, 6H). 

5 Example 7: Synthesis of 2,3?3-trimethyl-3H-benzo[g]indole-5-sulfonic acid. 

[0134] A solution of sodium 4-aminonaphthalene-l -sulfonate (2.45 g, 10 mmol) in H 2 0 (15 
mL) was added a solution of NaN0 2 (0.70 g, 10 mmol) in H 2 0 (2 mL) at 10-15 °C. The 
solution was then added to a cold solution of cone. H2SO4 (0.54 g, 5.5 mmol) in 0.5 mL of 
H 2 0. The temperature was maintained below 10 oC and the mixture was stirred for 1 .5 h after 

1 0 the addition was complete. The mixture was then added dropwise to a cold solution of SnC12 
(3.8g, 17 mmol) in 2.5 mL of cone. HC1 and 1.5 ml of H 2 0. The reaction temperature was kept 
below 10 °C and allowed to sit onernoght. It was filtered and washed thoroughly with H 2 0. 
The cake was removed twice from the Buchner funnel and suspended in H 2 0 and filtered. The 
solid obtained was dried under vacuum to give 1.4 g of 4-hydrazinylnaphthalene-l -sulfonic 

15 acid (59%). The material was used directly for next step. A solution of 4- 

hydrazinylnaphthalene-1 -sulfonic acid (0.7g, 2.9 mmol) in 3 ml of HOAc was added 3- 
methylbutan-2-one (0.5 ml, 1.5 eq), NaOAc (0.47g, 2.0 eq.). The mixture was stirred at 1 10 °C 
for 3.5h. After cooling and addition of ether, the precipitate was filtered to give 700 mg. The 
solid was dissolved in DCM and purified by flash chromatography on Si0 2 using 10% MeOH 

20 in DCM to give 273 mg of product. The mother liquor was concentrated and was also purified 
by flash chromatography on silica gel to give additional 372 mg of product. The overall yield 
was 77%. ! H NMR (DMSO) 5 ppm : 8.8 (dd, 1H), 8.4 (dd, 1H), 8.0 (s, 1H), 7.5 (m, 2H), 3.1 (s, 
5H), 2.3 (s, 4H), 1.3 (s, 6H). LC-MS: 288.2 [M-H]. 

Example 8: Synthesis of 3-(4-carboxybutyl)-l,l52-trimethyl-lH-benzo[e]indolium-7- 
25 sulfonate. 

[0135] To a solution of 6-aminonaphthalene-2-sulfonic acid (2.23 g, 10 mmol) in 15 ml of 
H 2 0 was added 2N NaOH (0.75 ml). The mixture was stirred for 5 min at RT and cone. H 2 SC>4 
(0.68 g, 6.9 mmol) was added to it dropwise at 0 °C. 2 ml of iced H20 was added, followed by 
a solution of NaN0 2 (1.04 g, 15 mmol) in 2 ml of H 2 0. After the solution was stirred at 0 °C 
30 for 2h, the diazonium salt was removed by filtration and washed with cold water. The salt was 
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then added in portions to a cold (<0 °C) solution of SnC12 (4.9g, 22 mmol) in 3.2 ml of cone. 
HC1 and 1.8 ml of H2O. The mixture was stirred overnight. The solid was filtered and washed 
twice with water, dried under vacuum to give 1.84 g of 6-hydrazinylnaphthalene-2-sulfonic 
acid (77%), which was used directly for next step. 

5 [0136] To a solution of 6-hydrazinylnaphthalene-2-sulfonic acid (0.7 g, 2.9 mmol) in 3 ml of 
HOAc was added 3-methylbutan-2-one (0.5 ml, 1.5 eq), NaOAc (0.47g, 2.0 eq.). The mixture 
was stirred at 110 °C for 3.5h. After cooling, the solvent was removed under reduced pressure. 
The residue was dissolved in MeOH and 3.0g of Si0 2 was added. MeOH was removed and the 
silica gel was loaded on a silica gel column and eluted with 10% MeOH in DCM to give the 
10 desired product, l,l,2-trimethyl-lH-benzo[e]indole-7-sulfonic acid (874 mg, >95% yield). LC- 
MS: 288.2 [M-H] 

[0137] To a solution of 1, 1, 2-trimethyl-lH-benzo[e]indole-7-sulfonic acid (723mg, 2.5 
mmol) in methanol (5 mL) was added a saturated solution of potassium hydroxide in 
isopropanol (3.134%, 4.9 13g, 1.1 eq.) and the resulting suspension was heated at reflux for 2h. 

1 5 Then the mixture was concentrated and the corresponding potassium salt was obtained. Under 
nitrogen atmosphere, a mixture of potassium 1,1, 2-trimethyl-lH-benzo[e]indole-7-sulfonate 
and 5- bromo valeric acid (585mg, 3 mmol) in 3-methyl-2-butanone (5 mL) was heated at 140 
°C for 20h. Removal of the solvent and purification by chromatography (dichloromethane / 
methanol) afforded 3-(4-carboxybutyl)-l,l,2-trimethyl-lH-benzo[e]indolium-7-sulfonate (70 

20 mg, isolated yield 7.2%). LC-MS: 390.18 [M+]. 

Example 9 Organocatalyzed hemicyanine synthesis in aqueous buffer. 

[0138] Scheme 8 provides an example of synthesizing hemicyaine 8 in aqueous buffer in the 
presence of various catalysts. The extent of hemicyanine formation was easily monitored by 
analytical reversed-phase HPLC (UV at 545 nm). Hemicyanine 8 has fluorescence excitation 
25 wavelength maximum at 535 nm and emission maximum at 580 nm. MALDI-MS analysis of 
the product confirms the structure (M+: 449.1992). The experimental data indicates (S)- 
pyrrolidinemethylpyrrolidine ((S)-PMP) has better catalytical ability than other catalysts. 
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Scheme 8: Hemicyanine formation in aqueous buffer in the presence of catalysts 
Q 
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Catalysts: 

(1) L-valine (0.3 equiv. water); (2) L-proline (0.3 equiv. water); 
3) (S)-FVrrolidinerTiethyIpyrrolia5ne (0.3 equiv. 50 mM sodium phsphate buffer; pH 8.5); 

(4) (S)-2-pyrrofidinemethanol (0.3 equiv. watr) ; (5) Zino-proline orZn(Pro)2 (0.1 equiv. 20 M sodium phoshate buffer, pH 8.5); 
(6) Pyrroiidine/aceticadd (0.4/0.2 equv. waer) ; (7) F^rrolidne/iacanphcfsulfonicacid (0.4/0.2, watr) 



Reaction condions: 25 mM each of the starting material, 20% NMP in water or sodium phosphate buffer plus required catalysts, RT for 16 hr. 



Examples 10-13 are related to DPC of hemicyanine formation. 
Example 10 DPC hemicyanine formation (end of helix architect) 

[0139] Scheme 9 gives an example of DPC hemicyanine formation through end of helix 
architecture. Upon annealing, the two hemicyanine precursors were placed in reactive 
proximity at the end of helix and a hemicyanine linked to both DNA was formed after 
condensation. 



Scheme 9: Example of synthesizing hemicyanine DNA dye 1 
through DPC (end of helix). 




EDC3 H2N-AGATC CCACT AGCAC (mismatch) (S)-1-(2-pyrrolidnylmeth y l)-pyrrolidine (PMP) 

(SEQ ID NOs: 75-77) 
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[0140] The progress of DPC reaction was monitored by fluorescence spectroscopy. FIG. 17 
shows the fluorescence emission of DPC reaction mixture of indolinium and aldehyde DNA 
(I_EDC2 and A_EDC4) (DNA: SEQ ID NO: 75; SEQ ID NO: 77)at various conditions. First, 
there is no background fluorescence emission for hemicyanine precursor I_EDC2 (DNA: SEQ 
5 ID NO: 75) and AJEDC4 alone in the reaction buffer (2 & 3 in FIG. 17). Second, the DPC 
reaction is catalyst dependent and not pH dependent. Without the addition of (S)-PMP, there is 
no fluorescence signal. Simply increasing the pH of the reaction conditions from pH 8.4 to 10.0 
did not generate any signal (5 & 6 in FIG. 17). Finally, DPC reaction is Watson-Crick 
sequence-specific (nucleic acid dependent). By switching EDC4 with three mismatched 

10 nucleobases DNA (EDC3, DNA: SEQ ID NO: 76), no fluorescence signal was generated after 
140 minutes. Only trace amount of fluorescence signal detected after 16 hr (FIG. 18). After 16 
hr, LC-MS shows almost 90% conversion of the product for reaction 1 (FIG. 19). FIG. 20 
gives the electrospray mass data of purified DPC product (EDC4_H_EDC2) (DNA: SEQ ID 
NO: 77; SEQ ID NO: 75) which confirms the structure. The relative quantum yield measured 

15 for EDC2_H_EDC4 (DNA: SEQ ID NO: 75; SEQ ID NO: 77) is around 0.1 in water using 
sulforhodamine 101 as fluorescence standard. The extinction coefficiency of EDC2_H_EDC4 
(DNA: SEQ ID NO: 75; SEQ ID NO: 77) at 550 nm is around 87000 in water. 

[0141] DPC reaction: Reactions were performed with 200 nM each of reagent in 10 mM (S)- 
PMP, 50 mM sodium phosphate buffer, pH 8.4, 1 M NaCl at RT unless otherwise specified. 
20 Catalyst (S)-PMP was added after mixing both reagents together in reaction buffer. 

Example 11 DPC hemicyanine formation (middle of helix architect) 

[0142] Scheme 10 provides another example of DPC hemicyanine formation through middle 
of helix architecture where the reactants were labeled to two probes which can complementary 
with a single template. Upon annealing, the two hemicyanine precursors were placed in 

25 reactive proximity at the middle of helix and a hemicyanine linked to both DNA was formed. 
The experimental data indicate only in the presence of the template, fluorescence signal is 
generated (FIG. 21). EDC2_H_EDC5 (DNA: SEQ ID NO: 75; SEQ ID NO: 78) was purified 
and its structure was confirmed by mass data (FIG. 22). The relative quantum yield measured 
for EDC2_H_EDC5 (DNA: SEQ ID NO: 75; SEQ ID NO: 78) alone is similar to 

30 EDC2_H_EDC4 (DNA: SEQ ID NO: 75; SEQ ID NO: 77) (0.1 in water). The extinction 
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coefficiency of EDC2_H_EDC5 (DNA: SEQ ID NO: 75; SEQ ID NO: 78) at 550 nm is around 
75000 in water. 



Scheme 10: Example of synthesizing hemicyanine_DNA dye 2 
through DPC (middle of helix). 

o 




atdehyde_EDC5 



Name Sequence (5' - 3') 

EDC1 GTGGT AGTTG GAGCT GGTGG CGTAG GCAAG A 
EDC2 H2N-AGCTC CAACT ACCAC 
5 EDC5 TCTTG CCTAC GCCAC-NH2 

(SEQ ID NOs: 74, 75, and 78) 

[0143] DPC reaction: Reactions were performed with 200 nM each of indolinium_EDC5 
(DNA: SEQ ID NO: 78), aldehyde_EDC5 (DNA: SEQ ID NO: 78), EDC1 (DNA: SEQ ID NO: 
10 74) in 10 mM (R)-PMP, 50 mM sodium phosphate buffer, pH 8.4, 1M NaCl at RT. (S)-PMP 
was added after mixing both reagents and template together in the reaction buffer. 

Example 12 Kinetic studies of hemicyanine DPC 

[0144] The reaction rate for both end-of-helix (Example 10) and middle-of-helix (Example 
11) DPC were investigated by fluorescence spectroscopy (FIG. 23 and FIG. 24). The kinetic 
15 profiles for both reactions are very similar. After around 16 hr, the fluorescence signal reached 
plateau which indicated the completion of the reaction. 

Example 13 Generation of four fluorescent hemicyanine dyes via middle-of-helix DPC. 
[0145] FIG. 3 gives an example of four-plex hemicyanine DNA dyes that can be generated 
through DPC. In this example, total two quaternary salt precursors (indolinium I and 
20 benzoindolinium BI) and two aldehydes (AO, Al) are needed to generate four DNA 

hemicyanine dyes. These precursors share the structural similarity, thus DPC conditions for 
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quaternary salt and aldehyde should be similar. According to literature data, benzoindolinium 
compound generally gives 20 nm of red shift while one extra double bond conjugation shifts 
the fluorescence emission wavelength towards to visible range (~ 80 nm for hemicyanine dye). 
FIG. 3 lists the spectroscopic properties of these four hemicyaine_DNA dyes. 

5 [0146] Small molecule hemicyanine dyes (9 to 12) were first synthesized and their 

fluorescence excitation and emission spectra were recorded (FIG. 25). As predicted, one extra 
phenyl substitution shifted the fluorescence emission 20 nm to the red (compound 9 and 10), 
while each additional vinyl group in the polyene chain shifted around 80 nm (9 and 11; 10 and 
12). To maximize the utility of the DNA codons and simplify the DNA probe preparation, only 

10 two DNA strands was used to label I and BI and two DNA strands for aldehydes with four 

template strands (FIG. 26). Each template had two unique codons that can only Watson-Crick 
base pair with one set of aldehyde and quaternary salt precursor to generate one hemicyanine 
dye. The DPC was performed in a combinatory fashion in one-pot. FIG. 27 shows normalized 
fluorescence emission spectra of four individual DPC reactions between indolinium/aldehyde 

15 (a) and benzoindolinium /aldehyde (b), indolinium/a,(5-unsaturated aldehyde 1 (c) and 

benzoindolinium/a,p -unsaturated aldehyde 1 (d) using above mentioned DNA codon (FIG. 
26). Four colors were generated from these DPC reactions. Similar to the fluorescence data 
from small molecule hemicyanine dyes (FIG. 25), around 1 5 nm of fluorescence emission 
wavelength difference was observed for one extra phenyl substitution (compound 13 and 14, 15 

20 and 16). One additional vinyl group in the polyene chain shifted around 80 nm of fluorescence 
emission wavelength (compound 13 and 15, 14 and 16). 

[0147] DPC reaction: Reactions were performed with 200 nM each of strands and template in 
10 mM N,N-dimethyl ethylenediamine (DMEDA), 50 mM sodium phosphate buffer, pH 8.4, 
150 mM NaCl at RT. Catalyst was added after mixing both reagents and template together in 
25 the reaction buffer. 

Example 15. Generation of two fluorescent hemicyanine dyes via end of helix DPC 

[0148] Two hemicyanine products were formed by mixing antizip3_indolinium with antizip2 
reporterl_A0 and antizip2 reporterl_Al respectively (DNA: SEQ ID NO: 69). The product 
(17) formed between antizip3_indolinium (DNA: SEQ ID NO: 70) and antizip2 reporterl AO 
30 (DNA: SEQ ID NO: 69) has excitation maximum at 540 nm and emission maximum at 600 
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nm, while the product (18) formed between antizip3_indolinium (DNA: SEQ ID NO: 70) and 
antizip2 reporterl_Al (DNA: SEQ ID NO: 69) had excitation maximum at 600 nm and 
emission maximum at 670 nm (FIG. 28). 

[0149] DPC reaction: Reactions were performed with 200 nM each of reagent in 15 mM 
5 DMEDA, 50 mM sodium phosphate buffer, pH 8.0, 2.5 mM MgC12 at 30 °C. Total reaction 
volume was 50 |iL. Catalyst DMEDA was added after mixing both reagents together in 
reaction buffer. Fluorescence was recorded immediately after the addition of catalyst DMEDA. 

GENERAL EXAMPLES ON DPC-BASED PROTEIN DETECTION 

Example 16. Creation of Fluorescence by Hybridization Induced Azidocoumarin 
10 Reduction 

[0150] Five oligonucleotides were prepared using standard phosphoramidite chemistry (Glen 
Research, Sterling VA, USA). Oligonucleotides bearing 5 '-amino groups (OHgo2 and 01igo6) 
were prepared using 5'-Amino-Modifier 5 and Oligonucleotides bearing 3'-aminogroups 
(01igo4 and OHgo5) were prepared using 3'-Amino-Modifier C7 CPG (Glen Research, Sterling 



[0151] Oligol, OHgo4 and OHgo5 were removed from the synthesis support and purified by 
reversed-phase HPLC. The amino groups of 01igo2 and OHgo6 were converted while resin- 
bound to their triphenyl phosphine derivatives and these were purified and isolated (Sakurai et 
25 a/., J. Amer. Chem. Soc, 2005, 727, ppl660-1667) to give OHgo2-TPP and OHgo-6TPP, 
respectively. 

[0152] Amino group bearing OHgo4 and OligoS were converted to their azidocoumarin 
derivatives (OHgo4-AzC and OligoS -AzC, respectively) by reaction of each oligo with the N- 
hydroxysuccinimide ester of 7-azido-4-methylcoumarin-3 -acetic acid (Thevenin et aL, Eur. J. 
30 Biochem (1992) Vol. 206, pp-47 1-477). The reaction was performed by adding 1 (aL of 

triflouroacetic acid to 5 jliL of N-methylmorpholine to prepare a buffer to which was added 10 
|iL of water containing 6.6 nmol of Oligo 4 or Oligo 5, followed by addition of 30 |iL of a 0.16 



15 VA,USA) 



Oligol 
01igo2 
OHgo4 
OligoS 



5 5 -GTGGT AGTTGGAGCTGGTGGCGTAGGC AAGA-3 ' (SEQ. ID. NO. 19) 
5'-H2N-AGCTCCAACTACCAC-3' (SEQ. ID. NO. 20) 

5 ' -GTGGT AGTTGGAGCT-NH2-3 ' (SEQ. ID. NO. 21) 

5 5 -TCTTGCCTACGCC AC-NH2-3 5 (SEQ. ID. NO. 22) 

5'-H2N-AGATCCCACTAGCAC-3' (SEQ. ID. NO. 23) 



20 OHgo6 
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M solution of the coumarin NHS-ester in dimethylformamide. Each reaction was allowed to 
proceed for 2 hours at room temperature, whereupon 50 [iL of 0.1 M aqueous 
triethylammonium acetate was added. The mixtures were applied to a NAP-5 desalting 
columns (Amersham Biosciences, Piscataway NJ USA) and eluted according to the 
5 manufacturers instructions the eluate was purified by RP-HPLC to provide 01igo4-AzC and 
OHgo5-AzC, in yields of 77% and 70%, respectively. Product identity was confirmed by 
Maldi-ToF mass spectrometry. 

[0153] To demonstrate the hybridization-specific creation of fluorescence, various 
combinations of complementary and non-complementary oligonucleotides bearing azido- 
10 coumarin and triphenyl phosphine moieties were allowed to react at room temperature in a 

buffer comprised of 30% aqueous formamide, 50 mM NaCl, and 10 mM sodium phosphate, pH 
7.2. The reaction progress was monitored over time using a Victor Multilabel fluorimeter 
(EG&G Wallach, Turku Finlnad) set to excite the sample at 360 nm and monitor light emission 
at 455 nm 

1 5 [0154] FIG. 29 shows that when OHgo4-AzC and OHgo2-TPP are combined to final 
concentrations of 200 nM and 400 nM respectively, a rapid increase in fluorescence is 
observed. In this figure 004 denotes OHgo4-AzC, 002 denote 01igo2-TPP, and 006 denotes 
OHgo6-TPP. The fluorescence does not occur when 01igo6-TPP is substituted for OHgo2-TPP. 
Whereas 01igo2-TPP is perfectly complementary in its base-pairing ability to OHgo4-AzC, 

20 01igo6-TPP is not, as it contains three mismatched nucleotides. The results support the 

conclusion that the creation of fluorescence is due to the ability of OHgo2-TPP to hybridize to 
OHgo4-AzC thus facilitating a reaction between the TPP and azidocoumarin moieties in the 
resulting hybrid. The lack of signal in the case of reaction of OHgo6-TPP with OHgo4-AzC is 
consistent with inability of these two oligonucleotides to form a duplex, therefore the reaction 

25 is not facilitated. Control reactions containing each single oligonucleotide were performed to 
rule out any non-specific effects. 

[0155] Results of additional experiments involving ternary complexes are shown in FIG, 30. 
In these experiments Oligol is tested for its ability to bring together by hybridization two 
perfectly complementary oligonucleotides (OHgo5-AzC and OHgo-2TPP) versus its ability to 
30 bring together one perfectly complementary oligonucleotide (OHgo5-AzC) and one partially- 
complementary oligonucleotide (OHgo6-TPP). Oligol and OHgo5-AzC were at 200 nM final 
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concentration, whereas OHgo2-TPP and 01igo6-TPP were employed at 400nM final 
concentration. In FIG. 30, 001 denotes Oligol, 002 denotes OHgo2-TPP, 005 denotes OHgo5- 
AzC, and 006 denotes OHgo6-TPP. The results show that fluorescence is generated only when 
the combination of fully complementary oligonucleotides is present (Oligol, OHgo5-AzC and 
5 Ohgo2-TPP). 

Example 17. Oligonucleotide Hybridization, Concentration and Melting Temperatures 

[0156] A model system was prepared which included two twenty-mer oligonucleotides with 
a ten-base complementary region and ten-base single stranded spacer arms, further linked to a 
six carbon spacer arm. These oligos were synthesized both with and without a S'-biotin (with a 
10 6-carbon spacer arm). As shown below, the complementary region is underlined. A third oligo 
was identical to the (-) strand oligo but with 4 base mismatches (italicized) to the (+) strand. 

Oligo 26 (+) strand 5' CTTCGGCC CAGATATCGT (SEQ. ID. NO. 24) 

Oligo 27 (-) strand 3' GTCTATAGCA TCGACATC (SEQ. ID. NO. 25) 

Oligo 28 (-) mismatch 3' 7L4CTATAGrG TCGACATC (SEQ. ID. NO. 26) 

15 

[0157] Melting curves of the 10-base pair oligonucleotide pair (oligo 26+oligo 27) were 
examined by measuring fluorescence of S YBR dye binding to double stranded DNA in a Bio- 
Rad iCycler (Lipsky, et al, Clinical Chemistry 2001, 47[4], 635-44). The binding curves are 
presented as the first derivative of the slope of the melting curve, such that a maximum value 
20 represents a point of inflection in the curve (a T m , or in a mixed population of double stranded 
sites, a "local" T m ). Binding curves can be obtained up to at least 70° C as avidin retains biotin 
binding activity up to this temperature and beyond. 

[0158] To check the dependence of this particular pair of oligonucleotides upon 
concentration, melting curves were generated for the oligonucleotide pair varied over the range 

25 from 500 to 20 nM (FIG. 31). (See, e.g., Lipsky, et ah, Clinical Chemistry 2001, 47[4], 635- 
44). The observed T m dropped at the rate of about 10°C per each ten- fold reduction in 
concentration (where RFU indicates relative fluorescence units) of the oligonucleotide pair, 
similar to prediction in the graph of FIG, 31. The melting curves were essentially identical for 
biotinylated and non biotinylated oligonucleotide pairs. The four base mismatched pair showed 

30 essentially no double stranded structure. 
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[0159] To test whether binding the (+) and (-) strands to a protein target would cause an 
increase in T m , the biotinylated version of these oligonucleotides were incubated in the 
presence of avidin. Avidin contains 4 equivalent binding sites, which are spaced relatively 
close together and bind to biotin very tightly (K a ~< 10" 15 M) and non-cooperatively. 

5 [0160] Presented with equal molar concentrations of oligonucleotides #26 and #27 in 

biotinylated form, it would be expected that about half of the biotin binding sites are occupied 
by complementary pairs of oligonucleotides, and about half with the same oligonucleotide 
(non-complementary pairs). The prediction is that one would observe two melting curve peaks 
in the presence of avidin. One peak would be the result of any pairs of oligonucleotides which 
10 were either not bound to avidin (free in solution) or which had only one partner of the two 
bound to avidin, which should not exhibit a proximity effect upon T m . A second peak of 
significantly higher T m would represent a pair of biotinylated oligos both bound to avidin, 
which should exhibit a proximity effect. 

[0161] Such an experiment was conducted as shown in FIG, 32. The oligonucleotides were 

15 added to a solution in the presence or absence of avidin held at 60° C, a so-called hot start. In a 
"hot start," the oligonucleotides bind to the biotin binding sites at a temperature well above 
their T m in solution, assuring that they are single stranded. The solution was then ramped down 
to 10° C and a melting curve analysis performed ascending to 70° C. As shown in FIG. 32, the 
melting curves of non-biotinylated oligo pair in the presence or absence of avidin showed a T m 

20 of 30-32° C (where RFU indicates relative fluorescence units). In the presence of avidin, 

however, two well separated T m peaks were generated with T m values of 33° C and 52° C. The 
elevated temperature peak (T m raised almost 20° C) was observed only in the presence of two 
complementary biotinylated oligonucleotides in the presence of avidin. The difference in T m 
+/- biotin tended to be greatest at lower salt concentrations (FIG. 33) and slightly higher in the 

25 presence of 10 mM magnesium chloride (FIG. 34) (where RFU indicates relative fluorescence 
units). The optimal molar ratio of biotinylated oligonucleotides to avidin was found to be about 
3.5:1, (with total concentration of oligos + avidin = 0.7 pM) consistent with avidin possessing 
four equivalent binding sites (FIG. 35) (RFU indicates relative fluorescence units). This is 
important because it substantiates that the requirement that the oligonucleotides bind to the 

30 same molecule of avidin for the T m effect to occur. The substitution of a 3' biotinylated (-) 
strand oligo for a 5' biotinylated strand oligonucleotide showed little difference in T m values 
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(FIG. 36) (RFU indicates relative fluorescence units) with previous results in which both 
oligonucleotides were 5' biotinylated. 

[0162] Results were essentially identical if the experiment was conducted by adding 
equimolar amounts of both the oligonucleotides at room temperature, ramping to 60° C, and 
5 then obtaining the melting curves. In this method (as well as the hot start method) suitable 
melting curves can be generated by adding an excess molar of each oligo relative to avidin if 
desired. (Large excesses of pairs of oligos increases the size of the low T m peak, however, as 
predicted.) This was not detrimental in forming high T m hybrid DNA since the pairs of oligos 
competed equally for biotin binding sites as long as they were added together in equal molar 

1 0 amounts. If oligos were added one at a time, it was important to add about a 2: 1 molar ratio of 
the first oligo to avidin followed by a 2:1 ratio of the second oligo. With sequential addition, 
adding an excess molar amount of either oligo relative to avidin occupies all the binding sites 
of the avidin with the first oligo and prevents occupying adjacent sites with the second, 
complementary oligo and exhibiting the elevated T m effect. These observations are consistent 

15 with the mechanism being binding of adjacent pairs of complementary oligos to two adjacent 
biotin binding sites to obtain hybrids exhibiting the elevated T m peaks. 

[0163] Experiments were also conducted with a 10-base self-complementary oligonucleotide 
which was composed entirely of A and T. (Oligo 3 1 : 5'-biotin-spacer arm- 



20 homogeneous in base composition and composed entirely of AT, it melted at a lower T m than 
the above-described model system and produced a fairly sharp melting curve. In the presence 
of avidin, its T m was increased from 30.5° C to 61.5° C (FIG, 37) (where RFU indicates relative 
fluorescence units). Since this oligonucleotide was self-complementary, all binding events lead 
to complementary strands, rather than only Vi of the events. Thus, only a single peak of 

25 increased T m was observed. 

[0164] These experiments were repeated using anti-biotin antibody as a target rather than 
avidin. Anti-biotin antibody contains two biotin binding sites located near the ends of the Fab 
portion of the antibody, but the binding sites are much further apart than the biotin binding sites 
on avidin. 




(SEQ. ID. NO. 27). Because this oligonucleotide was 
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Example 18. Detection of Protein Targets - Aptamers as Target Binders 

[0165] Here, an exemplary system was designed to utilize nucleic acid-templated 
azidocoumarin (AzC) -triphenylphosphine (TPP) chemistry to detect a protein target upon 
aptamer binding and annealing of the two complementary DNA probes. 

Materials 

[0166] Human PDGF-BB and PDGF-AA was obtained from R&D Systems (220-BB and 
220-AA, respectively). Anti-human PDGF-B Subunit monoclonal antibody was obtained from 
R&D Systems (MAB2201). Buffers included Tris/Mg buffer, at 50 mM Tris/HCl, pH 8.0-10 
mM MgCb. Oligonucleotides used were as follows: 



Oligonucleotide Sequences Used in this Example 



Oligo 

#/ 
(SEQ. 
ID#) 

201 
(28) 
202 
(29) 
203 
(30) 
204 
(31) 
205 
(32) 
206 
(33) 
207 
(34) 

208 
(35) 
209 
(36) 
210 
(37) 
211 
(38) 



Sequence (5' to 3') 

CAGGCTACGGCACGTAGAGCATCACCATG 

ATCCTG CCCCCCCCCCATATTTAAGC 

GCTTAAATATCCCCCCCCCC CAGGCTACGG 

CACGTAGAGCATCACCATGATCCTG 

GTGGGAATGGTGCCCCCCCCCC CAGGCTAC 

GGCACGTAGAGCATCACCATGATCCTG 

GTGGTAGTTGGAGTCGTGGCGTAGGCAAG 

A 

GTGGTAGTTGGAGTCACACGTGGCGTAGG 
CAAGA 

GTGGTAGTTGGAGCTCACACCACACGTGGC 
GTAGGCAAGA 

GTGGTAGTTGGAGTCACACACACCACACA 

CAGTGGCGTAGGCAAGA 

GTGGTAGTTGGAGCTCACACCACACCAACC 

ACACCACACCACACACACCACACGTGGCG 

TAGGCAAGA 

GTGTGGTGTGGTGTGGTGTG 

GTGGCGTAGGCAAGAGTGGTAGTTGGAGC 

T 

GTGGGAATGGTG 



5' - 

Mod' 

f. 



TPP 
none 
none 
none 
none 
none 
none 



none 



none 



3'- 

Mod 

'f. 



none 
AZC 
AZC 
none 
none 
none 
none 



Description 

DPC-aptamer 
probe 

DPC-aptamer 
probe 

DPC-aptamer 
probe-mismatch 

target 

target 

target 

target 



none none target 



none splint 

K-ras target 
none outward facing 



none TPP TPP probe 
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212 
(39) 
213 
(40) 
214 
(41) 
215 
(42) 



TCTTGCCTACGCCAC 

CAGGCTACGGCACGTAGAGCATCACCATG 
ATCCTG 



AGATCCCACTAGCAC 



AGCTCCAACTACCAC 



none 



none 



TPP 



TPP 



AZC 



none 



none 



none 



AZC probe 



TPP "mismatch 1 



aptamer 



TPP probe 



if 



Methods 

[0167] DPC Reaction conditions. Except as noted, each 100 microliter reaction contained, 
in a total volume of 100 jxl, 1 xTris/Mg buffer, 40 picomoles of TPP and AzC reaction probes, 
5 40 picomoles of target oligonucleotide or of target protein, and typically 25-30% v/v of 

formamide. Samples were incubated at 25° C in a Wallac Victor 1420 spectrophotometer and 
the increase in fluorescence monitored with excitation at 355 nm and emission at 460 nm. 

Results: Detection of PDGF-BB by Aptamer-DPC Probes 

10 [0168] As illustrated in FIG. 38, an aptamer sequence directed against platelet-derived 
growth factor (PDGF) B-subunits was selected (Fang, et a/., Chem. BioChem. 2003, 4, 829- 
34.). This belongs to a family of aptamers with strong affinity for PDGF B subunit (~10~ 9 M), 
and about ten-fold reduced affinity for PDGF A subunit. (Green, et al. 9 Biochemistry 35, 
14413-24. 1996) The probe sequences were synthesized, each containing a complementary 10- 

15 mer DNA sequence, a Cio spacer sequence, and the same 35-mer aptamer sequence. (Oligos 
#201, #202). Each sequence contained a 5'-TPP or 3'-AZC group with the aptamer linked 3' 
or 5', respectively. A second AzC probe, oligo #203, was the same as oligo #202 except that 
its annealing sequence was entirely mismatched to the TPP oligo (#201). 

[0169] As shown in FIG. 39, in the presence of 30% (volume) formamide, the reaction of the 
20 TPP and AzC probes with each other was entirely dependent upon the presence of PDGF-BB 
and complementary DNA sequences on the probes. The reaction failed in the absence of either 
probe. 

[0170] The DNA-dependence of the reaction was critically dependent upon the melting 
temperature of the DNA relative to the assay temperature. In the presence of 0% formamide 
25 (with the calculated and observed T m >Tassay> the reaction took place in the presence or absence 
of the target protein PDGF-BB (FIG. 40 A). In fact, under these conditions, addition of PDGF- 
BB did not increase, but reduced the reaction rate by about 50%. In 10% formamide, PDGF- 
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BB was less inhibitory (FIG. 40B). In 20% formamide (FIG. 41A), the situation was 
completely reversed - the reaction rate was now weak except in the presence of PDGF-BB. In 
30% formamide (FIG. 4 IB) the reaction was completely dependent upon the presence of 
PDGF-BB. In 40% formamide, the reaction was very slow with any set of reactants (FIG. 42). 
5 In all cases, the mismatched probes produced little or no reaction. 

[0171] DNA melting experiments with the complementary sequences, monitored with SYBR 
Green had indicated a T m of the sequence of about 30° C in the Tris/Mg buffer in the absence of 
formamide, and about 7° C lower for every 10% increase in formamide. T m in the optimal 
formamide concentration for the detection assay, 30%, was 10° C. 

10 [0172] In 0% formamide, the oligonucleotides can form at least a partial duplex even in the 
absence of PDGF-BB (T m slightly higher than Tassay). The DNA target-dependence of the 
reactions in 20% and 30% formamide is explained by the assay being conducted at a 
temperature greater than the T m in the absence of protein target. No reaction occurs unless the 
T m of the complex is increased by the binding of the two probes to the PDGF-BB target. At 

15 40% formamide, the reaction doesn't occur with any set of reactions. The likely explanation is 
that either the T m had been reduced so low that binding to PDGF-BB could not raise it above 
Tassay, or that formamide had inhibited PDGF-BB binding to the aptamers. A more complex 
situation is the observed inhibition of reaction rate upon addition of PDGF-BB in the absence of 
formamide. Since half of the duplexes formed by PDGF-BB are non-productive (50% will be 

20 homoduplexes) the reduction in rate is likely due to PDGF-BB binding preventing these 

homoduplexes from disassociating and then reassociating in solution with complementary pairs 
to form heteroduplexes. This situation should not occur using pairs of probes specifically 
directed against different binding sites in a heterodimeric target. 

[0173] The sensitivity of the assay (FIG. 43) was calculated by measuring reaction rates 
25 generated from a dilution series of PDGF-BB concentrations. The minimum detection level on 
the Wallac instrument was estimated at 0.8 picomoles in a 100 microliter assay volume, based 
upon the calculated value of three times the standard deviation of the background noise of the 
assay. 

[0174] The assay sensitivity was also determined using PDGF-AA as a target. The aptamer 
30 monomer is expected to have an affinity for PDGF-AA about ten times weaker than for PDGF- 
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BB. However, since the assay involves forming a complex of two aptamer-dimers to either 
type of PDGF, the avidity of binding of the dimer is expected to be tighter than the affinity of 
the monomer, and its affinity should be substantially tighter (lower Ki) than the concentrations 
tested of the target PDGFs (down to about 1 nanomolar). As shown in FIG. 44, the reaction 
5 rates of the aptamer DPC probes to PDGF-AA at low or high concentrations (0, 1 .25, 2.5, 5, 10, 
20, and 40 pmole of PDGF-AA) were not substantially different than the reaction rates with 
PDGF-BB. This is consistent with the model of an aptamer pair binding as a dimer and 
exhibiting increased avidity. 

[0175] Ratios ofTPP to AzC Probes. To confirm the model of the reaction mechanism 
10 (FIG, 8, the optimal ratio of TPP to AzC probes would be expected to be 1:1), FIG. 45 was an 
experiment in which the total amount of the two probes was kept constant, at 800 nMoles 
probes/reaction, while the ratio of the two probes was varied. The ratio producing the highest 
reaction rate was approximately 1:1, consistent with the expected mechanism. 

[0176] Thus, in this model system fluorescence was not generated unless the aptamers bound 
1 5 and the complementary sequences in the two probes annealed to each other. 

Example 19. Zip-Coded Architecture for Nucleic Acid-templated Chemistry Based- 
Biodetection with Aptamer Binders 

[0177] FIG. 14 [15x] illustrates in more detail an exemplary zip-code architect. The TPP 
pair contained, first, a PDGF-aptamer on the 5' -end, a CI 8 polyethylene-glycol based spacer, 

20 and an 18-mer zip code sequence. The TPP reporter sequence contained a complementary anti- 
zip code sequence on its 3' terminus, a C18 PEG spacer, and a ten base pair reporter sequence 
terminating in a 5' TPP group. The pair of oligonucleotides comprising the AzC detection 
probe contained a 3' -aptamer linked through a CI 8 PEG spacer to a separate zip code, and a 
detection oligonucleotide linked to a 5' anti-zip code, a CI 8 PEG spacer, and a reporter 

25 oligonucleotide (complementary to the TPP oligonucleotide) terminating in a 3' AzC group. 

[0178] The reaction, in 35% formamide at 22°C, was dependent upon the presence of both of 
the reporter oligonucleotides, both of the aptamer oligonucleotides, and the target, PDGF-BB 
(FIG. 46). At 22°C in the absence of formamide, the reaction proceeded independently of the 
presence of PDGF. This is consistent with the behavior of the above-described "one-piece" 
30 architech, and reflects that the mechanism of fluorescence generation in 35% formamide is 
dependent the increased thermal stability of the reporter sequence duplex in formamide upon 



WO 2008/054600 



PCT/US2007/021094 



-46- 

addition of PDGF. In the absence of formamide at 22°C, the reporter oligonucleotide duplex is 
stable both in the presence and absence of PDGF. 

[0179] Confirmation of the correctness of the model was obtained with experiments varying 
the ratio of the TPP and AzC aptamer oligos (FIG, 47). These experiments indicated that the 
5 optimal ratio of the aptamer oligos was the expected 1 : 1 ratio (i.e. 50% TPP oligo with a total 
concentration of PDGF and aptamer oligos of 0.4 jjM). The optimal ratio of total reporter 
oligonucleotides to total aptamer oligos was also 1:1. No PDGF-dependent reaction occurred in 
the complete absence of either one of the reporter or aptamer oligonucleotides. At higher than 
stoicheometric concentrations of reporter oligonucleotides, the PDGF-independent signal 
10 increased (background) but the PDGF-dependent signal remained about constant. Both of these 
observations are consistent with the model that the complex is assembled in the ratio of 1 : 1 : 1 
for each of the aptamer oligos, each of the reporter oligos, and PDGF. 

[0180] These experiments indicate that the complex can self-assemble in solution, such that 
each zip code and its anti-zip code anneal to each other with minimal interference with the 
1 5 aptamer sequence or the reporter sequences. 

[0181] Experiments were also conducted to determine if the order of addition, and thus 
assembly of the aptamer and reporter probes, was of any importance. Slightly slower reaction 
rates were obtained if the aptamer oligonucleotides were first incubated with PDGF before 
adding the reporter oligonucleotides, compared with adding all probes together as a mixture. 
20 Somewhat greater reaction rates were obtained if each pair of aptamer oligonucleotides and 
reporter oligonucleotides was first incubated and allowed to assemble with each other before 
the two sets were mixed together and incubated with PDGF. The reason for this may be that 
there is some steric hindrance to zip code-anti zip code annealing to aptamer probe if the 
aptamer probe is already bound to target. 

25 [0182] As a control, a set of one-piece TPP and AzC probes was compared which contained 
only the zip code sequences and no zip code-anti zip code sequences (FIG. 48). The reaction 
rates of this one-piece system were similar to that of the two-piece system, except that the rate 
enhancement due to the addition of PDGF was typically slightly better than that of the two- 
piece system. 
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[0183] The sequence of the aptamer-containing TPP and AzC probes was also systematically 
varied to determine any constraints on the design. The aptamer-containing TPP and AzC oligos 
were synthesized, both having the same sequences as described in FIG. 14 but with the 
following changes: (1) omission of the C18-PEG spacer. (Oligos 1 19 & 122); (2) replacement 
5 of the C18-PEG spacer with the sequence Cio- (oligos 120 & 123); (3) replacement of the CIS- 
PEG spacer with the sequence C 2 o. (oligos 121&124); (4) Omission of the C18-PEG spacer and 
omitting 3 3 '-bases in the zip code region (reduction to 15 bases in length), (oligos 127 & 129); 
and (5) omission of the CI 8-PEG spacer and omitting 6 3 '-bases in the zip code region 
(reduction to 12 bases in length), (oligos 128 & 130). 

10 [0184] Oligonucleotides used in this example included: 

Qligo#/ Sequence (5'-3*) Modification 

(SEQ. ID NO. 43) 

106 GGACTCGAGCACCAATAC-X-TATAAATTCG-AZC X= CI 8 PEG; AZC = 3'- AzC. 
(SEQ. ID NO. 44) 

15 109 CGAATTTATA-X-CTGACCATCGATGGCAGC X=C18 PEG, 5'-TPP 

(SEQ. ID NO. 45) 

1 1 2 CAGGCTACGGCACGTAGAGC ATCACC ATG ATCCTG-X-GCTGCCATCG ATGGTCAG X= C 1 8 
PEG 

(SEQ. ID NO. 46) 

20 113 GTATTGGTGCTCGAGTCC-X-CAGGCTACGGCACGTAGAGC ATCACC ATG ATCCTG X= CI 8 PEG 
(SEQ. ID NO. 47) 

1 19 GTATTGGTGCTCGAGTCCCAGGCTACGGCACGTAGAGCATCACCATG ATCCTG 
(SEQ. ID NO. 48) 

120 GTATTGGTGCTCGAGTCCCCCCCCCCCCCAGGCTACGGCACGTAGAGCATCACCATGATCCTG 
25 (SEQ. ID NO. 49) 

121 

GTATTGGTGCTCGAGTCCCCCCCCCCCCCCCCCCCCCCCAGGCTACGGCACGTAGAGCATCACCATGA 
TCCTG 

(SEQ. ID NO. 50) 

30 122 CAGGCTACGGCACGTAGAGC ATCACC ATG ATCCTG GCTGCCATCG ATGGTCAG 
(SEQ. ID NO. 51) 

123 CAGGCTACGGCACGTAGAGCATCACCATGATCCTGCCCCCCCCCCGCTGCCATCGATGGTCAG 
(SEQ. ID NO. 52) 
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124 

CAGGCTACGGCACGTAGAGCATCACCATGATCCTGCCCCCCCCCCCCCCCCCCCCGCTGCCATCGATG 
GTCAG 

(SEQ. ID NO. 53) 

5 1 27 CAGGCTACGGC ACGTAG AGCATC ACC ATG ATCCTGGCTGCC ATCG ATGGT 
(SEQ. ID NO. 54) 

128 CAGGCTACGGCACGTAGAGCATCACCATGATCCTGGCTGCC ATCG AT 
(SEQ. ID NO. 55) 

129 TTGGTGCTCGAGTCCCAGGCTACGGC ACGTAG AGCATC ACC ATG ATCCTG 
10 (SEQ. ID NO. 56) 

1 30 GTGCTCG AGTCCC AGGCTACGGCACGTAG AGCATC ACC ATG ATCCTG 

[0185] None of these changes resulted in a significant difference in the performance of the 
system. Experiments 4) and 5) also resulted in a 3 and 6-base single stranded (not annealed to 
zip code) structure immediately upstream of the CI 8 spacer in the reporter oligonucleotides. 

15 [0186] The results of these experiments indicate that the aptamer-based PDGF detection 
system can be assembled separating the binding and DPC functions into two separate 
oligonucleotides. Through the selection of appropriate zip code sequences, the detection format 
described in FIG. 13 self-assembled into pairs of annealed oligonucleotides which will function 
similarly to oligonucleotides synthesized in a single piece. The reporter and aptamer 

20 oligonucleotides may be separately assembled prior to introduction of target, or all species may 
be added together in almost any order. This process may be extended to the solution-phase 
assembly of more than one pair of annealed detection oligos, for example, to detect multiple 
targets. Detection of multiple targets may require using different reporter oligonucleotides 
which generate separately discernable signals (for example, different wavelengths of emitted 

25 light). 

[0187] These results indicate that a zip-coded reporting approach can be effectively 
designed, for example, using aptamer-containing oligonucleotides. 

[0188] While the results with the aptamer system indicate that a stable complex between 
binding and reporter sequences can be formed simply by annealing the zip code and anti-zip 
30 code regions, it should be noted that there are technologies to covalently and irreversibly link 
the two oligonucleotides together, with a high likelihood of retaining activity of the reporter 
reactive groups. For example, the oligonucleotides may be incubated in pairs (a binder 
oligonucleotide and a reactive oligonucleotide for nucleic acid-template chemistry) at a 
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temperature at which the zip codes and anti-zip codes are mostly double stranded, but the rest 
of the sequences are single-stranded. Adding an intercalating, photoactivatable cross-linker 
such as Trioxalen, followed by UV irradiation, may irreversibly crosslink the two strands. 
Similarly, UV irradiation may introduce thymidine dimers between separate strands of annealed 
5 sequences. Alternately, a sequence may be introduced complementary to a short target (splice) 
DNA, abutting 3' and 5', which may then be ligated with DNA ligase. The splice 
oligonucleotide may alternately be composed of RNA, and removed after ligation with RNase 
H, which hydrolyzes RNA annealed to DNA. This can result in converting the two 
oligonucleotides into a single piece of single-stranded DNA. These methods can lead to cost- 
10 effective production of oligonucleotide reagents in detection kits against specific targets. 

[0189] Relevant references for this example include Capaldi, et aL, Nucleic Acid Res. 2000, 
28[7], e21.; Castiglioni, et aL, Appl. and Exper. Microbio. 2004, 7161-72; Fang, et aL, 
Chem.BioChem. 2003, 4, 829-34.; Gerry, et aL, J. Mol. Biol. 1999, 292, 251-62. 

Example 20. Zip-Coded Architecture for DPC-based Biodetection - Antibody Binders 
15 [0190] In another embodiment, the aptamer sequences are replaced with non-DNA binders 
such as antibodies. For PDGF and other protein targets, the aptamer sequences are replaced 
with chemically active groups, such as aldehydes, and reacted with non-DNA binder sequences 
such as antibodies or receptors to the protein targets (FIG. 16). The optimal design for the 
binder and reporter oligonucleotides may be achieved with considerations on the size and 
20 geometry of the binder and size and geometry of the binding sites of the target. A longer, or 
shorter spacer arms, for example, may be used to optimally span the distance between binding 
sites on the target and avoid steric hindrance due to the binders themselves. 

[0191] Referring to FIG, 16, the zip-coded oligonucleotide designed to hybridize to the TPP 
reporter molecule was synthesized containing a 5 '-amino group. The zip-coded oligonucleotide 
25 designed to hybridize to the AzC reporter molecule contained a 3 '-amino group. Synthesis of 
the conjugates between the oligonucleotides and anti-PDGF-BB antibody were performed by 
SoluLink Biosciences (San Diego, CA). 

[0192] The SoluLink technology for conjugation of the antibody and oligonucleotides first 
requires modification of the primary amino groups of the antibody with succinimidyl 2- 
30 hydrazinonicotinate acetone hydrazone) to incorporate an acetone hydrazone onto the antibody. 
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The primary amines of the oligonucleotides are separately activated with succinimimdyl 4- 
formylbenzoate. The two activated molecules are mixed in the desired ratio (typically 6:1) and 
reacted at a mildly acidic pH to form a stable hydrazone linkage. The details of this chemistry 
are available at www.SoluLink.com. Two conjugates were prepared: one containing the zip 
5 code to anneal to the AzC -containing reporter oligonucleotide, and the other containing the zip 
code to anneal to the TPP-containing reporter oligonucleotide. 

[0193] The antibody-oligonucleotide conjugates received from SoluLink were further 
purified by gel chromatography on a 1.6 x 60 cm column of Superdex S-200 (Amersham 
Biosciences) in PBS buffer (0.01 M potassium phosphate, pH 7.4 - 0.138 M sodium chloride). 

10 The main antibody peak, eluting at about 0.6 times the column volume, was collected and a 
later eluting peak of contaminating non-conjugated oligonucleotide was discarded. The 
antibody conjugate was concentrated by reversed dialysis with a Pierce (Rockford, IL) 30 K 
molecular weight cut-off Slide-A-Lyzer using Pierce Concentrating Solution. The protein 
content was determined using the Bio-Rad Micro BCA Reagent Kit and the oligonucleotide 

15 content determined using SYBR Gold DNA binding dye (Molecular Probes (Eugene, OR). The 
conjugates were both determined to contain an average of approximately 3 oligonucleotides per 
protein molecule. 

[0194] Recombinant human PDGF-BB (220-BB) and mouse monoclonal anti-PDGF-BB 
(MAB220) were obtained from R&D Systems (Minneapolis MN). 

20 [0195] Sequences used in this study included (where AzC indicates azidocoumarin and TPP 
indicates triphenylphosphine): 

Name Sequence (5'-3') 

TPP reporter TPP-(amino modifier C6)-CGAATTTATA-C 1 8PEG-TCAGCATCGTACCTCAGC 
25 (SEQ ID NO.: 9) (SEQ ID NO.: 58) 

AzC reporter GGACTCGAGCACCAATAC-C 1 8 PEG-TATAAATTCG-(amino modifier C7)-AzC 

(SEQ ID NO.: 14) (SEQ ID NO.: 10) 

AzC zip code TTGGTGCTCGAGTCCCCCCCCCCCCCCCCCCCCCC-(amino modifier C7) 

(SEQ ID NO.: 59) 

30 TPP zip code (amino modifier C6)-CCCCCCCCCCCCCCCCCCCCGCTGAGGTACGATGCTGA 

(SEQ ID NO.: 60) 
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[0196] In addition, the 5' amino modifier C6 was obtained from Glen Research (from Glen 
Research phosphoramidite 1 10-1906). The 3 '-amino modifier C7 was obtained from Glen 
Research (from Glen Research CPG 20-2957). The CI 8 PEG was obtained from Glen 
Research (from Glen Research phosphoramidite 10-1918). 

5 Assembly of antibody-oligo conjugates with reporter oligonucleotides. 

[0197] The two antibody-oligo conjugates with their reporter were first assembled separately 
in a volume of 10 \xL Each assembly contained 0.5 (5 picomoles) of antibody- 
oligonucleotide conjugate and 0.15 jaM of (15 pmoles) of complementary reporter 
oligonucleotide in 0.05 M Tris/HCl pH 8 - 0.01 M magnesium chloride. Each was incubated 
10 for at least 15 minutes at 4° C before use in the detection reaction mixture. 

Detection Reaction of anti-PDGF-BB DPC Conjugates/Reporters with PDGF-BB 

[0198] To conduct detection reaction, each reaction may contain in a volume of 50 jil: 10 p.1 
of each conjugate assembly, prepared as described above, and variable amounts of PDGF-BB, 
in a buffer of 0.05 M Tris/HCl pH 8 - 0.01 M magnesium chloride-40% volume/volume 
15 formamide. The conjugates are present in this reaction mixture at 0.2 |iM. Samples are 

incubated in the wells of a black 96-well microplate in a Wallac Victor Luminometer at 25° C. 
Fluorescence can be followed vs. time with excitation at 355 nm and emission at 460 nm. 

[0199] Reactions typically may be carried out at 25° C, monitoring fluorescence generation 
at the wavelength optimums of the reaction product, 7-amino coumarin. 



20 INCORPORATION BY REFERENCE 

[0200] The entire disclosure of each of the publications and patent documents referred to 
herein is incorporated by reference in its entirety for all purposes to the same extent as if each 
individual publication or patent document were so individually denoted. 

EQUIVALENTS 

25 [0201] The invention may be embodied in other specific forms without departing form the 
spirit or essential characteristics thereof. The foregoing embodiments are therefore to be 
considered in all respects illustrative rather than limiting on the invention described herein. 
Scope of the invention is thus indicated by the appended claims rather than by the foregoing 
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description, and all changes that come within the meaning and range of equivalency of the 
claims are intended to be embraced therein. 

[0202] What is claimed is: 
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CLAIMS 

1 1 . A method for making a polymethine dye comprising conducting an aldol condensation 

2 between an aldehyde and an active hydrogen component in an aqueous condition in the 

3 presence of an organocatalyst. 

1 2. The method of Claim 1 wherein the condensation reaction is: 




3 wherein 

Z' = O, S, Se, P, NH 2 , NR, C(CH 3 ) 2 where R is alkyl group 
n = 0, 1,2... 
R = H, alkyl 

R" = H, alkyl, alkyl carboxylic acid 

4 R' = Ph or A/-heterocycle, H, alkyl, S0 3 H, OH, CN, CI, Br, N0 2 , NH 2l N(R) 2 , OR where R is alkyl group 

5 

6 and wherein the organocatalyst is a secondary amine, a primary amine, or a bifunctional 

7 amine-acid catalyst. 

1 3. The method of Claim 2 wherein the secondary amine is a pyrrolidine, a piperidine, a 

2 nornicotine, or an analog thereof. 

1 4. The method of Claim 2 wherein the primary amine is a valine or a peptide having fewer 

2 than 3 amino acid units. 

1 5. The method of Claim 2 wherein the bifunctional amine-acid catalyst is 

2 pyrrolidine/AcOH. 

1 6. The method of Claim 2 wherein the organocatalyst is pyrrolidnylmethyl-pyrrolidine, 

2 aminomethyl pyrrolidine, dimethylethane-l,2-diamine, propane- 1,2-diamine, 1 -(2-aminoethyl)- 

3 piperidine, or diethylethylene-1 ,2-diamine. 
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1 7. A hemicyanine dye having the chemical structure of (I) prepared by the method of 

2 Claim 2. 



3 




4 Wherein 



Z = O, S, Se, P, NH 2 , NR, C(CH 3 ) 2 where R is alkyl group 
□ = 0,1,2... 
R = H, alkyl 

R" = H, alkyl, alkyl carboxylic acid 
5 R' = Ph or /V-heterocycle, H, alkyl, S0 3 H, OH, CN, CI, Br, N0 2 , NH 2 , N(R) 2 , OR where R is alkyl group 



1 8. A hemicyanine dye having the chemical structure of II or III: 




n1 = 1,2, 3,4,5 
n2, n3 = 0to 16 
R1 = alkyl 

R2 = Ph, H, alkyl, S0 3 H, OH, CN, CI, Br, N0 2 , 
NH 2 , N(R 1 ) 2 , ORt while R, is alkyl group 
R 3 = Ph, H, alkyl, SO3H, OH, CN, CI, Br, N0 2 , 
2 NH 2 , N(R 1 ) 2 , ORt while is alkyl group 



n1 = 1, 2, 3, 4, 5 
n2, n3 = 0to 16 
R1 = alkyl 

R2 = Ph, H, alkyl, S0 3 H, OH, CN, CI, Br, N0 2 , 
NH 2 , N(R 1 ) 2 , ORt while R A is alkyl group 
R 3 = Ph, H, alkyl, S0 3 H, OH, CN, CI, Br, N0 2 , 
NH 2 , N(R 1 ) 2 , ORt while R, is alkyl group 



3 



(II) 
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1 9. 



2 
3 

1 



2 
3 

1 



An aldehyde having the chemical structure of IV or V: 

^OH 



OHC 



R 2 

ni 




OHC 



n1 = 1,2, 3, 4, 5 
n2 = Oto 16 
R1 = alkyl 

R 2 = Ph, H, alkyl, S0 3 H, OH, CN, CI, Br, 
N0 2 , NH 2 , N(R 1 ) 2 , OR-) while R 1 is alkyl 
group 



(IV) 



n1 =1,2, 3, 4,5 
n2 = 0 to 16 
R1 = alkyl 

R 2 = Ph, H, alkyl, S0 3 H, OH, CN, CI, Br, 
N0 2 , NH 2 , N(R-,) 2 , ORi while Ri is alkyl 
group 



(V) 



10. 



A quaternary salt having the chemical structure of VI or VII: 

R. 




\ OH 



n = 0to16 

R = H, alkyl, S0 3 H, OH, 
CN, CI, Br, N0 2 , NH 2 , 
N(Ri>2. ORi while R n is 
alkyl group 



(VI) 




n = 0to 16 

R = H, alkyl, S0 3 H, OH, 
CN, CI, Br, N0 2 , NH 2 , 

N ( R i)2. 0R 1 whi,e R 1 is 
alkyl group 



(VII) 



11. 



A quaternary salt-nucleic acid conjugate having the chemical structure of: 

1 _r 



< 

R— 



A 



nucleic acid 



n = 0to16 

Z1 = O, S, Se, P, NH 2 , NR-,, C(CH 3 ) 2 where is alkyl group 
R = any substituted benzyl or higher fused benzyl rings, H, 
alkyl, S0 3 H ( OH, CN, CI, Br, N0 2) NH 2 , N(R 1 ) 2) ORi while R A 
is alkyl group 

Z 2 = benzene or any N-heterocycles 
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1 12. An aldehyde-nucleic acid conjugate having the chemical structure of: 



2 
1 



2 

1 

2 
3 

1 

2 

1 

2 

1 

2 

3 
4 

5 
6 



R 2 



OHCJ 




ni 



nucleic acid 



n1 = 1,2, 3, 4,5 
n2 = 0to16 
R1 = H, alkyl 

R 2 = Ph or N-heterocycle, H, alkyl, S0 3 H, 
OH, CN, CI, Br, N0 2 , NH 2 , N^fe, OR, 
while R<i is alkyl group 

13. A hemicyanine dye-nucleic acid conjugate having the chemical structure of: 



nucleic acid N 




N 



\ J nucleic acid 

N 

H 



n1 =1,2, 3, 4,5 
n2 and n3 = 0 to 16 
Z1 = O, S, Se, P, NH 2 , NR 1} C(CH 3 ) 2 
where R^ is alkyl group 
R 3 = any substituted benzyl or higher 
fused benzyl rings, H, alkyl, S0 3 H, OH, 
CN, CI, Br, N0 2 , NH 2 , N(R 1 ) 2 , OR n while 
R 1 is alkyl group 

Z 2 = benzene or any N-heterocycIes 
R1 = H, alkyl 
R 2 = Ph or N-heterocycle, H, alkyl, S0 3 H, 
OH, CN, CI, Br, N0 2 , NH 2 , N(R 1 ) 2 , OR! 
while is alkyl group 

14. A method for making a hemicyanine-nucleic acid conjugate comprising conducting a 
nucleic acid-templated reaction between an aldehyde of Claim 12 and quaternary salt of Claim 
1 1 to make a hemicyanine of Claim 13. 

15. The method of Claim 14 wherein the nucleic acid-templated reaction is in an end of 
helix architect. 

16. The method of Claim 14 wherein wherein the nucleic acid-templated reaction is in a 
middle of helix architect. 

17. A method for selecting a dye having a desired fluorescent property, the method 
comprising: 

(a) preparing a library of oligonucleotide-encoded dyes through nucleic acid- 
templated synthesis; 

(b) hybridizing the oligonucleotide-encoded dyes with spatially arrayed 
complementary oligonucleotide probes immobilized on a solid support; 



WO 2008/054600 



PCT/US2007/021094 



-57- 

7 (c) measuring the absorption and fluorescence properties of the oligonucleotide- 

8 encoded dye directly on the solid support. 

9 (d) identifying the oligonucleotides that encode the dyes having the desired 

1 0 fluorescence properties based on the position of the immobilized complementary 

1 1 oligonucleotide probes, and 

12 (e) identifying and characterizing the chemical structure of the dyes having the 

1 3 desired fluorescence property. 

1 18. A method for detecting multiple target nucleotide sequences, the method comprising: 

2 (a) providing a number of probe pairs, the number equal to the number of target 

3 nucleotide sequences, wherein each probe pair comprises (1) a first probe comprising (i) a first 

4 oligonucleotide sequence and (ii) a first reactive group linked to the first oligonucleotide 

5 sequence, and (2) a corresponding second probe comprising (i) a second oligonucleotide 

6 sequence and (ii) a second reactive group linked to the second oligonucleotide sequence, 

7 wherein the first oligonucleotide sequence and the second oligonucleotide sequence are 

8 complementary to two separate regions of a corresponding target nucleotide sequence; 

9 (b) combining the probe pairs with a sample to be tested for the presence of the 

1 0 target nucleotide sequences under conditions where the first probes and the second probes 

1 1 hybridize to their respective complementary regions of the target nucleotide sequences if 

12 present in the sample thereby bringing into reactive proximity the first reactive groups and the 

13 corresponding second reactive groups; and 

14 (c) detecting one or more reactions between the first reactive groups and the 

1 5 corresponding second reactive groups thereby determining the presence of the target nucleotide 

16 sequences. 

1 19. The method of Claim 1 8 wherein the number of target nucleotide sequences is 2 to 20. 

1 20. The method of Claim 1 8 wherein the number of target nucleotide sequences is 2 to 6. 

1 21 . The method of Claim 18 wherein the target nucleotide sequences are in solution phase. 

1 22. The method of Claim 1 8 wherein the target nucleotide sequences are attached to a solid 

2 support. 
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1 23. The method of Claim 1 8 wherein the one or more reactions between the first reactive 

2 groups and the corresponding second reactive groups generate fluorescent compounds that may 

3 be detected. 

1 24. The method of Claim 1 8 wherein the one or more reactions between the first reactive 

2 groups and the corresponding second reactive groups generate chemiluminescent compounds 

3 that may be detected. 

1 25. The method of Claim 1 8 wherein the one or more reactions between the first reactive 

2 groups and the corresponding second reactive groups comprise an aldol condensation reaction. 

1 26. The method of Claim 18 wherein the one or more reactions between the first reactive 

2 groups and the corresponding second reactive groups comprise a Wittig reaction. 
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